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i860TM 64-Bit Microprocessor 



■ Parallel Architecture that Supports Up 
to Three Operations per Clock 

— One Integer or Control Instruction 
per Clock 

— Up to Two Floating-Point Results per 
Clock 

■ High Performance Design 

— 33.3/40 l\/IHz Clock Rates 

— 80 Peak Single Precision MFLOPs 

— 60 Peak Double Precision MFLOPs 

— 64-Bit External Data Bus 

— 64-Bit Internal Instruction Cache Bus 

— 128-Bit Internal Data Cache Bus 

■ High Level of Integration on One Chip 

— 32-Bit Integer and Control Unit 

— 32/64-Bit Pipelined Floating-Point 
Adder and l\1ultiplier Units 

— 64-Bit 3-D Graphics Unit 

— Paging Unit with Translation 
Lookaside Buffer 

— 4 Kbyte Instruction Cache 

— 8 Kbyte Data Cache 

The Intel i860TM Microprocessor (order codes A80860-33 and A80860-40) delivers supercomputing perform- 
ance in a single VLSI component. The 64-bit design of the 860 microprocessor balances integer, floating point, 
and graphics performance for applications such as engineering workstations, scientific computing, 3-D graph- 
ics workstations, and multiuser systems. Its parallel architecture achieves high throughput with RISC design 
techniques, pipelined processing units, wide data paths, large on-chip caches, million-transistor design, and 
fast one-micron CHMOS IV silicon technology. 

A31-A3 D63-D0 CONTROL 



Compatible with Industry Standards 

— ANSI/IEEE Standard 754-1985 for 
Binary Floating-Point Arithmetic 

— 386TM/486TM Microprocessor Data 
Formats and Page Table Entries 

— JEDEC 168-pin Ceramic Pin Grid 
Array Package (see Packaging 
Outlines and Dimensions, order 
#231369) 

Easy to Use 

— On-Chip Debug Register 

— Assembler, Linker, Simulator, 
Debugger, C and FORTRAN 
Compilers, FORTRAN Vectorizer, 
Scalar and Vector Math Libraries for 
both OS/2'^ and UNIX"^ Environments 



L_U 




Figure 0.1. Block Diagram 

Intel, Intel, 386» 486, i860. Multibus II and Parallel System Bus are trademarks of Intel Corporation. 

*UNIX is a registered trademark of AT&T. OS/2 is a trademark of International Business Maciiines Corporation. 

Intel Corporation assumes no responsibility for the use of any circuitry other than circuitry embodied In an Intel product. No other circuit patent 
licenses are implied. Information contained herein supersedes previously published specifications on these devices from Intel. February 1989 
© Intel Corporation, 1 989 Order Number: 240296-001 
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1.0 FUNCTIONAL DESCRIPTION 

As shown by the block diagram on the front page, 
the 860 microprocessor consists of 9 units: 

1 . Core Execution Unit 

2. Floating-Point Control Unit 

3. Floating-Point Adder Unit 

4. Floating-Point Multiplier Unit 

5. Graphics Unit 

6. Paging Unit 

7. Instruction Cache 

8. Data Cache 

9. Bus and Cache Control Unit 

The core execution unit controls overall operation of 
the 860 microprocessor. The core unit executes 
load, store, integer, bit, and control-transfer opera- 
tions, and fetches instructions for the floating-point 
unit as well. A set of 32 x 32-bit general-purpose 
registers are provided for the manipulation of Integer 
data. Load and store instructions move 8-, 1 6-, and 
32-bit data to and from these registers. Its full set of 
integer, logical, and control-transfer instructions give 
the core unit the ability to execute complete systems 
software and applications programs. A trap mecha- 
nism provides rapid response to exceptions and ex- 
ternal interrupts. Debugging is supported by the abili- 
ty to trap on data or instruction reference. 

The floating-point hardware is connected to a sepa- 
rate set of floating-point registers, which can be 
accessed as 1 6 x 64-blt registers, or 32 x 32-blt reg- 
isters. Special load and store instructions can also 
access these same registers as 8 x 1 28-bit registers. 
All floating-point instructions use these registers as 
their source and destination operands. 

The floating-point control unit controls both the float- 
ing-point adder and the floating-point multiplier, issu- 
ing instructions, handling all source and result 
exceptions, and updating status bits In the floating- 
point status register. The adder and multiplier can 
operate in parallel, producing up to two results per 
clock. The floating-point data types, floating-point in- 
structions, and exception handling all support the 
IEEE Standard for Binary Floating-Point Arithmetic 
(ANSI/IEEE Std 754-1985). 

The floating-point adder performs addition, subtrac- 
tion, comparison, and conversions on 64- and 32-bit 
floating-point values. An adder instruction executes 
in three to four clocks; however, in pipelined mode, a 
new result is generated every clock. 

The floating-point multiplier performs floating-point 
and integer multiply and floating-point reciprocal op- 
erations on 64- and 32-bit floating-point values. A 
multiplier instruction executes in three to four clocks; 



however, in pipelined mode, a new result can be 
generated every clock for single-precision and every 
other clock for double precision. 

The graphics unit has special integer logic that sup- 
ports three-dimensional drawing in a graphics frame 
buffer, with color intensity shading and hidden sur- 
face elimination via the Z-buffer algorithm. The 
graphics unit recognizes the pixel as an 8-, 1 6-, or 
32-bit data type. It can compute individual red, blue, 
and green color intensity values within a pixel; but it 
does so with parallel operations that take advantage 
of the 64-bit Internal word size and 64-bit external 
bus. The graphics features of the 860 microproces- 
sor assume that the surface of a solid object is 
drawn with polygon patches whose shapes approxi- 
mate the original object. The color intensities of the 
vertices of the polygon and their distances from the 
viewer are known, but the distances and intensities 
of the other points must be calculated by interpola- 
tion. The graphics instructions of the 860 microproc- 
essor directly aid such interpolation. 

The paging unit implements protected, paged, virtual 
memory via a 64-entry, four-way set-associative 
memory called the TLB (Translation Lookaside Buff- 
er). The paging unit uses the TLB to perform the 
translation of logical address to physical address, 
and to check for access violations. The access pro- 
tection scheme employs two levels of privilege: user 
and supervisor. 

The instruction cache is a two-way set-associative 
memory of four Kbytes, with 32-byte blocks. It trans- 
fers up to 64 bits per clock (266 Mbyte/sec at 
33.3 MHz). 

The data cache is a two-way set-associative memo- 
ry of eight Kbytes, with 32-byte blocks. It transfers 
up to 128 bits per clock (533 Mbyte/sec at 33.3 
MHz). The 860 microprocessor normally uses write- 
back caching, i.e. memory writes update the cache 
(if applicable) without necessarily updating memory 
immediately; however, caching can be inhibited by 
software where necessary. 

The bus and cache control unit performs data and 
instruction accesses for the core unit. It receives cy- 
cle requests and specifications from the core unit, 
performs the data-cache or instuction-cache miss 
processing, controls TLB translation, and provides 
the interface to the external bus. Its pipelined struc- 
ture supports up to three outstanding bus cycles. 

2.0 PROGRAMMING INTERFACE 

The programmer-visible aspects of the architecture 
of the 860 microprocessor include data types, regis- 
ters, instructions, and traps. 
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2.1 Data Types 

The 860 microprocessor provides operations for in- 
teger and floating-point data. Integer operations are 
performed on 32-bit operands with some support 
also for 64-bit operands. Load and store instructions 
can reference 8-bit, 16-bit, 32-bit, 64-bit, and 128-bit 
operands. Floating-point operations are performed 
on IEEE-standard 32- and 64-bit formats. Graphics 
oriented instructions operate on arrays of 8-, 16-, or 
32-bit pixels. 



2.1.2 ORDINAL 

Arithmetic operations are available for 32-bit ordi- 
nals. An ordinal is an unsigned integer. An ordinal 
can represent values in the range to 
4,294,967,295 ( + 232-1). 

Also, there are add and subtract instructions that op- 
erate on 64-bit ordinals. 



2.1.3 SINGLE- AND DOUBLE-PRECISION REAL 



2.1.1 INTEGER 

An integer is a 32-bit signed value in standard two's 
complement form. A 32-bit integer can represent a 
value in the range -2,147,483,648 (-231) to 
2,147,438,647 ( + 231 - 1). Arithmetic operations on 
8- and 1 6-bit integers can be performed by sign-ex- 
tending the 8- or 16-bit values to 32 bits, then using 
the 32-bit operations. 

There are also add and subtract instructions that op- 
erate on 64-bit long Integers. 

Load and store instructions may also reference (in 
addition to the 32- and 64-bit formats previously 
mentioned) 8- and 1 6-bit items in memory. When an 
8- or 16-bit item is loaded into a register, it is con- 
verted to an integer by sign-extending the value to 
32 bits. When an 8- or 16-bit item Is stored from a 
register, the corresponding number of low-order bits 
of the register are used. 



Figure 2.1 shows the real number formats. A single- 
precision real (also called "single real") data type is 
a 32-bit binary floating-point number. Bit 31 Is the 
sign bit; bits 30..23 are the exponent; and bits 22..0 
are the fraction. In accordance with ANSI /IEEE 
standard 754, the value of a single-precision real is 
defined as follows: 

1 . If e = and i =^ or e = 255 then generate a 
floating-point source-exception trap when en- 
countered in a floating-point operation. 

2. If < e < 255, then the value is -is x 1.f x 

26-127. 

3. If e = and f = 0, then the value is signed zero. 

A double-precision real (also called "double real") 
data type is a 64-bit binary floating-point number. Bit 
63 is the sign bit; bits 62.. 52 are the exponent; and 
bits 51 ..0 are the fraction. In accordance with ANSI/ 
IEEE standard 754, the value of a double-precision 
real is defined as follows: 

1 . If e = and f =7^ or e = 2047, then generate a 
floating-point source-exception trap when en- 
countered in a floating-point operation. 



2. If < e < 2047, then the value is -1s x 

26-1023. 



I.f X 
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Double-Precision Real 



FRACTION 
EXPONENT 
SIGN 



Figure 2.1. Real Number Formats 
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3. If e = and f = 0, then the value is signed zero. 

The special values infinity, NaN ("Not a Number"), 
indefinite, and denormal generate a trap when en- 
countered. The trap handler implements IEEE-stan- 
dard results. 

A double real value occupies an even/odd pair of 
floating-point registers. Bits 31 ..0 are stored in the 
even-numbered floating-point register; bits 63..32 
are stored in the next higher odd-numbered floating- 
point register. 



2.1.4 PIXEL 

A pixel may be 8, 1 6, or 32 bits long depending on 
color and intensity resolution requirements. Regard- 
less of the pixel size, the 860 microprocessor always 
operates on 64 bits worth of pixels at a time. The 
pixel data type is used by two kinds of instructions: 

• The selective pixel-store instruction that helps im- 
plement hidden surface elimination. 

• The pixel add instruction that helps implement 
3-D color intensity shading. 

To perform color intensity shading efficiently in a va- 
riety of applications, the 860 microprocessor defines 
three pixel formats according to Table 2.1. 

Figure 2.2 illustrates one way of assigning meaning 
to the fields of pixels. These assignments are for 
illustration purposes only. The 860 microprocessor 
defines only the field sizes, not the specific use of 
each field. Other ways of using the fields of pixels 
are possible. 



Table 2.1. Pixel Formats 



Pixel 

Size 

(in bits) 



8 
16 
32 



Bits of 
Color 1 
Intensity 



Bits of 
Color 2 
Intensity 



Bits of 
Color 3 
Intensity 



N(^ 8) bits of intensity* 



Bits of 

Other 

Attribute 

(Texture) 



8- N 



The intensity attribute fields may be assigned to colors in 
any order convenient to the application. 

*With 8-bit pixels, up to 8 bits can be used for intensity; the 
remaining bits can be used for any other attribute, such as 
color. The intensity bits must be the low-order bits of the 
pixel. 



2.2 Register Set 

As Figure 2.3 shows, the 860 microprocessor has 
the following registers: 

• An integer register file 

• A floating-point register file 

• Six control registers (psr, epsr, db, dirbase, fir, 
and fsr) 

• Four special-purpose registers (KR, Kl, T, and 
MERGE) 

The control registers are accessible only by load 
and store control-register instructions; the integer 
and floating-point registers are accessed by arithme- 
tic operations and load and store instructions. The 
special-purpose registers KR, Kl, T, and MERGE are 
used by a few specific instructions. 



8-BIT PIXEL 



c 



15 



16-BIT PIXEL 



R 


G 


B 



32-BIT PIXEL 

31 



23 



15 



" 


G 


B 


T 
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I — Intensity, R — Red intensity, G — Green intensity, B — Blue intensity, C — Color, T — Texture 

These assignments of specific meanings to the fields of pixels are for illustration purposes only. Only the field sizes are 

defined, not the specific use of each field. 



Figure 2.2. Pixel Format Example 
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2.2.1 INTEGER REGISTER FILE 

There are 32 integer registers, each 32-bits wide, 
referred to as rO through r31, which are used for 
address computation and scalar integer computa- 
tions. Register rO always returns zero when read, 
independently of what is stored in it. 



2.2.2 FLOATING-POINT REGISTER FILE 

There are 32 floating-point registers, each 32-bits 
wide, referred to as fO through f31, which are used 
for floating-point computations. Registers fO and f 1 
always return zero when read, independently of 
what is stored in them. The floating-point registers 
are also used by a set of integer operations, primari- 
ly for vector integer computations. 

When accessing 64-bit floating-point or integer val- 
ues, the 860 microprocessor uses an even/odd pair 
of registers. When accessing 1 28-bit values, it uses 
an aligned set of four registers (fO, f4, f8, ..., f28). 
The instruction must designate the lowest register 
number of the set of registers containing 64- or 1 28- 
bit values. Misaligned register numbers produce un- 
defined results. The register with the lowest number 
contains the least significant part of the value. For 
1 28-bit values, the register pair with the lower num- 
ber contains the 64 bits at the lowest memory ad- 
dress; the register pair with the higher number con- 
tains the 64 bits at the highest address. 



The 128-bit load and store instructions, along with 
the 128-bit data path between the floating-point reg- 
isters and the data cache help to sustain an extraor- 
dinarily high rate of computation. 

2.2.3 PROCESSOR STATUS REGISTER 

The processor status register (psr) contains miscel- 
laneous state information for the current process. 
Figure 2.4 shows the format of the psr. 

• BR (Break Read) and BW (Break Write) enable a 
data access trap when the operand address 
matches the address in the db register and a 
read or write (respectively) occurs. 

• Various instructions set CC (Condition Code) ac- 
cording to tests they perform. The branch-on- 
condition-code instructions test its value. The bla 
instruction sets and tests LCC (Loop Condition 
Code). 

• IM (Interrupt Mode) enables external interrupts if 
set; disables interrupts if clear. 

• U (User Mode) is set when the 860 microproces- 
sor is executing in user mode; it is clear when the 
860 microprocessor is executing in supervisor 
mode. In user mode, writes to some control regis- 
ters are inhibited. This bit also controls the mem- 
ory protection mechanism. 



10 



inteT 



i860TM MICROPROCESSOR 



APmiKl©! 0K1[F@[^IM1A¥0@M 



EXTERNAL 
MEMORY 




CORE 
UNIT 



...^iP_br^.. . 



64 64 / / 

^ MULTIPLIER UNIT 



FP 
ADDER UNIT 



GRAPHICS UNIT 



Figure 2.3. Registers and Data Paths 
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*Can be changed only from supervisor level. 



Figure 2.4 Processor Status Register 
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(RESERVED) 
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B 
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P 
B 
M 


DCS 


(RESERVED) 


W 
P 


1 
L 


STEPPING 
NUMBER 


PROCESSOR 
TYPE 



- PAGE-TABLE BIT MODE 

- BIG ENDIAN MODE 

- OVERFLOW FLAG 



*Can be changed only from supervisor level 



Figure 2.5 Extended Processor Status Register 



• PIM (Previous Interrupt Mode) and PU (Previous 
User Mode) save the corresponding status bits 
(IM and U) on a trap, because those status bits 
are changed when a trap occurs. They are re- 
stored into their corresponding status bits when 
returning from a trap handler with a branch indi- 
rect instruction when a trap flag is set in the psr. 

• FT (Floating-Point Trap), DAT (Data Access 
Trap), lAT (Instruction Access Trap), IN (Inter- 
rupt), and IT (Instruction Trap) are trap flags. 
They are set when the corresponding trap condi- 
tion occurs. The trap handler examines these bits 
to determine which condition or conditions have 
caused the trap. 



• DS (Delayed Switch) is set if a trap occurs during 
the instruction before dual-Instruction mode is en- 
tered or exited. If DS is set and DIM (Dual Instruc- 
tion Mode) is clear, the 860 microprocessor 
switches to dual-instruction mode one instruction 
after returning from the trap handler. If DS and 
DIM are both set, the 860 microprocessor 
switches to single-Instruction mode one instruc- 
tion after returning from the trap handler. 

• When a trap occurs, the 860 microprocessor sets 
DIM if it is executing in dual-instruction mode; it 
clears DIM if it is executing in single-instruction 
mode. If DIM is set after returning from a trap 
handler, the 860 microprocessor resumes execu- 
tion in dual-instruction mode. 
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• When KNF (Kill Next Floating-Point Instruction) is 
set, the next floating-point instruction is sup- 
pressed (except that its dual-instruction mode bit 
is interpreted). A trap handler sets KNF if the 
trapped floating-point instruction should not be 
reexecuted. 

• SC (Shift Count) stores the shift count used by 
the last right-shift instruction. It controls the num- 
ber of shifts executed by the double-shift instruc- 
tion. 

• PS (Pixel Size) and PM (Pixel Mask) are used by 
the pixel-store instruction and by the vector inte- 
ger instructions. The values of PS control pixel 
size as defined by Table 2.2. The bits in PM cor- 
respond to pixels to be updated by the pixel-store 
instruction pst.d. The low-order bit of PM corre- 
sponds to the low-order pixel of the 64-bit source 
operand of pst.d. The number of low-order bits of 
PM that are actually used is the number of pixels 
that fit into 64-bits, which depends upon PS. If a 
bit of PM is set, then pst.d stores the corre- 
sponding pixel. Refer also to the pstd instruction 
in section 8. 

Table 2.2. Values of PS 



Value 


Pixel Size 
in bits 


Pixel Size 
in bytes 


00 
01 
10 

11 


8 

16 

32 

(undefined) 


1 

2 

4 

(undefined) 



2.2.4 EXTENDED PROCESSOR STATUS 
REGISTER 

The extended processor status register (epsr) con- 
tains additional state information for the current pro- 
cess beyond that stored in the psr. Figure 2.5 shows 
the format of the epsr. 

• The processor type is one for the 860 microproc- 
essor. 

• The stepping number has a unique value that dis- 
tinguishes among different revisions of the proc- 
essor. 

• IL (Interlock) Is set if a trap occurs after a lock 
instruction but before theload or store following 
the subsequent unlock instruction. IL indicates to 
the trap hadnler that a locked sequence has 
been interrupted. 

• WP (write protect) controls the semantics of the 
W bit of page table entries. A clear W bit in either 
the directory or the page table entry causes 
writes to be trapped. When WP is clear, writes 
are trapped in user mode, but not in supervisor 
mode. When WP is set, writes are trapped in both 
user and supervisor modes. 

• DCS (Data Cache Size) is a read-only field that 
tells the size of the on-chip data cache. The num- 
ber of bytes actually available is 212+ DCS; there- 
fore, a value of zero Indicates 4 Kbytes, one indi- 
cates 8 Kbytes, etc. 



ADDRESS TRANSLATION ENABLE 

DRAM PAGE SIZE 

BUS LOCK 

l-CACHE, TLB INVALIDATE 
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CODE SIZE 8-BIT 
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Figure 2.6. Directory Base Register 
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• PBM (Page-Table Bit Mode) determines wiiich bit 
of page-table entries is output on the PTB pin. 
When PBM is clear, the PTB signal reflects bit CD 
of the page-table entry used for the current cycle. 
When PBM is set, the PTB signal reflects bit WT 
of the page-table entry used for the current cycle. 

• BE (Big Endian) controls the ordering of bytes 
within a data item in memory. Normally (i.e. when 
BE is clear) the 860 microprocessor operates in 
little endian mode, in which the addressed byte is 
the low-order byte. When BE is set (bit endian 
mode), the low-order three bits of all load and 
store addresses are complemented, then 
masked to the appropriate boundary for align- 
ment. This causes the addressed byte to be the 
most significant byte. 

• OF (Overflow Flag) Is set by adds, addu, subs, 
and subu when integer overflow occurs. For 
adds and subs, OF is set if the carry from bit 31 
is different than the carry from bit 30. For addu, 
OF is set if there is a carry from bit 31 . For subu, 
OF is set if there is no carry from bit 31 . Under all 
other conditions, it is cleared by these instruc- 
tions. OF controls the function of the intovr 
instruction. 



2.2.5 DATA BREAKPOINT REGISTER 

The data breakpoint register (db) is used to gener- 
ate a trap when the 860 microprocessor makes a 
data-operand access to the address stored in this 
register. The trap is enabled by BR and BW in psr. 
When comparing, a number of low order bits of the 
address are ignored, depending on the size of the 
operand. For example, a 16-bit access ignores the 
low-order bit of the address when comparing to db; 
a 32-bit access ignores the low-order two bits. This 
ensures that any access that overlaps the address 
contained in the register will generate a trap. 



2.2.6 DIRECTORY BASE REGISTER 

The directory base register dirbase (shown in Figure 
2.5) controls address translation, caching, and bus 
options. 

• ATE (Address Translation Enable), when set, en- 
ables the virtual-address translation algorithm. 
The data cache must be flushed before changing 
the ATE bit. 

• DPS (DRAM Page Size) controls how many bits 
to ignore when comparing the current bus-cycle 
address with the previous bus-cycle address to 



generate the NENE# signal. This feature allows 
for higher speeds when using static column or 
page-mode DRAMs and consecutive reads and 
writes access the row. The comparison ignores 
the low-order 1 2 + DPS bits. A value of zero is 
appropriate for one bank of 256K x n RAMs, 1 
for 1 M X A7 RAMS, etc. 

When BL (Bus Lock) is set, external bus access- 
es are locked. The LOCK# signal is asserted the 
next bus cycle whose internal bus request is gen- 
erated after BL is set. It remains set on every 
subsequent bus cycle as long as BL remains set. 
The LOCK# signal is deasserted on the next bus 
cycle whose internal bus request is generated af- 
ter BL is cleared. Traps immediately clear BL. 
The lock and unlock instructions control the BL 
bit. 

ITI (l-Cache, TLB Invalidate), when set in the val- 
ue that is loaded into dirbase, causes the instruc- 
tion cache and address-translation cache (TLB) 
to be flushed. The ITI bit does not remain set in 
dirbase. ITI always appears as zero when read- 
ing dirbase. The data cache must be flushed be- 
fore invalidating the TLB. 

When CS8 (Code Size 8-Bit) is set, instruction 
cache misses are processed as 8-bit bus cycles. 
When this bit is clear, instruction cache misses 
are processed as 64-bit bus cycles. This bit can 
not be set by software; hardware sets this bit at 
initialization time. It can be cleared by software 
(one time only) to allow the system to execute out 
of 64-bit memory after bootstrapping from 8-bit 
EPROM. A nondelayed branch to code in 64-bit 
memory should directly follow the st.c (store con- 
trol register) instruction that clears CS8, in order 
to make the transition from 8-bit to 64-bit memory 
occur at the correct time. The branch must be 
aligned on a 64-bit boundary. 

RB (Replacement Block) identifies the cache 
block to be replaced by cache replacement algo- 
rithms. The high-order bit of RB is ignored by the 
instruction and data caches. RB conditions the 
cache flush instruction flush, which is discussed 
in Section 8. Table 2.3 explains the values of RB. 

RC (Replacement Control) controls cache re- 
placement algorithms. Table 2.4 explains the sig- 
nificance of the values of RC. 

DTB (Directory Table Base) contains the high-or- 
der 20 bits of the physical address of the page 
directory when address translation is enabled (i.e. 
ATE = 1). The low-order 12 bits of the address 
are zeros. 
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Figure 2.7. Floating-Point Status Register 





Table 2.3. Values of RB 


Value 


Replace 
TLB Block 


Replace Instruction 
and Data Cache Block 




1 

1 

1 1 



1 
2 
3 




1 


1 



Table 2.4. Values of RC 



Value 


Meaning 


00 


Selects the normal replacement 
alogrithm where any block in the set 
may be replaced on cache misses in all 
caches. 


01 


Instruction, data, and TLB cache 
misses replace the block selected by 
RB. The instruction and data caches 
ignore the high-order bit of RB. This 
mode is used for instruction cache and 
TLB testing. 


10 


Data cache misses replace the block 
selected by the low-order bit of RB. 


11 


Disables data cache replacement. 



2.2.7 FAULT INSTRUCTION REGISTER 

When a trap occurs, this register contains the ad- 
dress Of the trapping instruction (not necessarily the 
instruction that created the conditions that required 
the trap). 



2.2.8 FLOATING-POINT STATUS REGISTER 

The floating-point status register (fsr) contains the 
floating-point trap and rounding-mode status for the 
current process. Figure 2.6 shows its format. 

• If FZ (Flush Zero) is clear and underflow occurs, 
a result-exception trap is generated. When FZ is 
set and underflow occurs, the result is set to zero, 
and no trap due to underflow occurs. 

• If Tl (Trap Inexact) is clear, inexact results do not 
cause a trap. If Tl is set, inexact results cause a 
trap. The sticky inexact flag (SI) is set whenever 
an inexact result is produced, regardless of the 
setting of Tl. 

• RM (Rounding Mode) specifies one of the four 
rounding modes defined by the IEEE standard. 
Given a true result b that cannot be represented 
by the target data type, the 860 microprocessor 
determines the two representable numbers a and 
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Table 2.5. Values of RM 



Value 


Rounding Mode 


Rounding Action 


00 


Round to nearest or even 


Closer to if? of a or c; if equally 
close, select even number 
(the one whose least 
significant bit is zero). 


01 


Round down (toward - <») 


a 


10 


Round up (toward + <» 


c 


11 


Chop (toward zero) 


Smaller in magnitude of a or c. 



c that most closely bracket b in value {a < b < 
c). The 860 microprocessor then rounds (chang- 
es) bXoaox c according to the mode selected by 
RM as defined in Table 2.5. Rounding introduces 
an error in the result that is less than one least- 
significant bit. 

• The U-bit (Update Bit), If set in the value that is 
loaded into fsr by a st.c instruction, enables up- 
dating of the result-status bits (AE, AA, Al, AO, 
AU, MA, Ml, MO, and MU) in the first-stage of the 
floating-point adder and multiplier pipelines. If this 
bit is clear, the result-status bits are unaffected 
by a st.c Instruction; st.c ignores the correspond- 
ing bits in the value that is being loaded. An st.c 
always updates fsr bits 21.. 17 and 8..0 directly. 
The U-bit does not remain set; it always appears 
as zero when read. 

• The FTE (Floating-Point Trap Enable) bit, if clear, 
disables all floating-point traps (invalid input oper- 
and, overflow, underflow, and inexact result). 

• SI (Sticky Inexact) is set when the last-stage re- 
sult of either the multiplier or adder is inexact (i.e. 
when either Al or Ml is set). SI is "sticky" in the 
sense that it remains set until reset by software. 
Al and Ml, on the other hand, can by changed by 
the subsequent floating-point instruction. 

• SE (Source Exception) is set when one of the 
source operands of a floating-point operation Is 
invalid; it is cleared when all the input operands 
are valid. Invalid input operands include denor- 
mals, infinities, and all NaNs (both quiet and sig- 
naling). 

• When read from the fsr, the result-status bits MA, 
Ml, MO, and MU (Multiplier Add-One, Inexact, 
Overflow, and Underflow, respectively) describe 
the last-stage result of the multiplier. 

When read from the fsr, the result-status bits AA, 
Al, AO, AU, and AE (Adder Add-One, Inexact, 
Overflow, Underflow, and Exponent, respectively) 
describe the last-stage result of the adder. The 
high-order three bits of the 1 1 -bit exponent of the 
adder result are stored in the AE field. 

After a floating-point operation in a given unit (ad- 
der or multiplier), the result-status bits of that unit 
are undefined until the point at which result ex- 
ceptions are reported. 



When written to the fsr with the U-bIt set, the 
result-status bits are placed into the first stage of 
the adder and multiplier pipelines. When the 
processor executes pipelined operations, it prop- 
agates the result-status bits of a particular unit 
(multiplier or adder) one stage for each pipelined 
floating-point operation for that unit. When they 
reach the last stage, they replace the normal re- 
sult-status bits in the fsr. When the U-bit is not 
set, result-status bits in the word being writeen to 
the fsr are ignored. 

In a floating-point dual-operation Instruction (e.g. 
add-and-multiply or subtract-and-multiply), both 
the multiplier and the adder may set exception 
bits. The result-status bits for a particular unit re- 
main set until the next operation that uses that 
unit. 

RR (Result Register) specifies which floating- 
point register (f0-f31) was the destination regis- 
ter when a result-exception trap occurs due to a 
scalar operation. 

LRP (Load Pipe Result Precision), IRP (Vector-In- 
teger Pipe Result Precision), MRP (Multiplier Pipe 
Result Precision), and ARP (Adder Pipe Result 
Precision) aid in restoring pipeline state after a 
trap or process switch. Each defines the preci- 
sion of the last-stage result in the corresponding 
pipeline. One of these bits is set when the result 
In the last stage of the corresponding pipeline is 
double precision; it is cleared if the result is single 
precision. These bits cannot be changed by soft- 
ware. 



2.2.9 KR, Kl, T, AND MERGE REGISTERS 

The KR, Kl, and T registers are special-purpose reg- 
isters used by the dual-operation floating-point 
instructions pfam,/pfmam, and pfmsm, which initi- 
ate both an adder (A-unit) operation and a multiplier 
(M-unit) operation. The KR, Kl, and T registers can 
store values from one dual-operation instruction and 
supply them as Inputs to subsequent dual-operation 
instructions. (Refer to Table 2.9.) 

The MERGE register is used only by the vector-inte- 
ger instructions. The purpose of the MERGE register 
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is to accumulate (or merge) the results of multiple- 
addition operations that use as operands the color- 
intensity values from pixels or distance values from 
a Z-buffer. The accumulated results can then be 
stored in one 64-bit operation. 

Two multiple-addition instructions and an OR in- 
struction use the MERGE register. The addition In- 
structions are designed to add interpolation values 
to each color-intensity field in an array of pixels or to 
each distance value in a Z-buffer. 

Refer to the instruction descriptions in section 8 for 
more information about these registers. 



2.3 Addressing 

Memory is addressed in byte units with a paged vir- 
tual-address space of 232 bytes. Data and instruc- 
tions can be located anywhere in this address 
space. Address arithmetic is performed using 32-bit 
input values and produces 32-bit results. The low-or- 
der 32 bits of the result ae used in case of overflow. 

Normally, multibyte data values are stored in memo- 
ry in little endian format, i.e., with the least significant 
byte at the lowest memory address. As an option 
that may be dynamically selected by software in su- 
pervisor mode, the 860 microprocessor also offers 
big endian mode, in which the most significant byte 
of a data item is at the lowest address. Code ac- 
cesses are always done with little endian address- 
ing. Figure 2.8 shows the difference between the 
two storage modes. Big endian and little endian data 
areas should not be mixed within a 64-bit data word. 
Illustrations of data structures in this data sheet 



show data stored in little endian mode, i.e., the right- 
most (low-order) byte is at the lowest memory ad- 
dress. 

Alignment requirements are as follows (any violation 
results in a data-access trap): 

• 1 28-bit values are aligned on 1 6-byte boundaries 
when referenced in memory (i.e. the four least 
significant address bits must be zero). 

• 64-bit values are aligned on 8-byte boundaries 
when referenced in memory (i.e. the three least 
significant address bits must be zero). 

• 32-bit values are aligned on 4-byte boundaries 
when referenced in memory (i.e. the two least 
significant address bits must be zero). 

• 16-bit values are aligned on 2-byte boundaries 
when referenced in memory (i.e. the least signifi- 
cant address bit must be zero). 



2.4 Virtual Addressing 

When address translation is enabled, the 860 micro- 
processor maps instruction and data virtual address- 
es into physical addresses before referencing mem- 
ory. This address transformation is compatible with 
that of the 386 microprocessor and implements the 
basic features needed for page-oriented virtual- 
memory systems and page-level protection. 

The address translation is optional. Address transla- 
tion is In effect only when the ATE bit of dirbase is 
set. This bit is typically set by the operating system 
during software initialization. The ATE bit must be 
set if the operating system is to implement page-ori- 
ented protection or page-oriented virtual memory. 
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Figure 2.8. Little Big Endian Memory Format 
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Figure 2.9. Format of a Virtual Address 



Address translation is disabled when the processor 
is reset. It is enabled when a store to dirbase sets 
the ATE bit. It is disabled again when a store clears 
the ATE bit. 



2.4.1 PAGE FRAI\/IE 

A page frame is a 4-Kbyte unit of contiguous ad- 
dresses of physical main memory. Page frames be- 
gin on 4-Kbyte boundaries and are fixed in size. A 
page is the collection of data that occupies a page 
frame when that data is present in main memory or 
occupies some location in secondary storage when 
there is not sufficient space in main memory. 

2.4.2 VIRTUAL ADDRESS 

A virtual address refers indirectly to a physical ad- 
dress by specifying a page table, a page within that 



table, and an offset within that page. Figure 2.9 
shows the format of a virtual address. 

Figure 2.9 shows how the 860 microprocessor con- 
verts the DIR, PAGE, and OFFSET fields of a virtual 
address into the physical address by consulting two 
levels of page tables. The addressing mechanism 
uses the DIR field as an index into a page directory, 
uses the PAGE field as an index into the page table 
determined by the page directory, and uses the 
OFFSET field to address a byte within the page de- 
termined by the page table. 



2.4.3 PAGE TABLES 

A page table is simply an array of 32-bit page specifi- 
ers. A page table is itself a page, and therefore con- 
tains 4 Kbytes of memory or at most 1 K 32-bit en- 
tries. 
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Figure 2.10. Address Translation 
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Two levels of tables are used to address a page of 
memory. At the higher level is a page directory. The 
page directory addresses up to 1K page tables of 
the second level. A page table of the second level 
addresses up to 1 K pages. All the tables addressed 
by one page directory, therefore, can address 1M 
pages (220). Because each page contains 4 Kbytes 
(212 bytes), the tables of one page directory can 
span the entire physical address space of the 860 
microprocessor (220 x 212 = 232). 

The physical address of the current page directory is 
stored In DTB field of the dirbase register. Memory 
management software has the option of using one 
page directory for all processes, one page directory 
for each process, or some combination of the two. 

2.4.4 PAGE-TABLE ENTRIES 

Page-table entries (PTEs) in either level of page ta- 
bles have the same format. Figure 2.11 illustrates 
this format. 



2.4.4.2 Present Bit 

The P (present) bit indicates whether a page table 
entry can be used In address translation. P = 1 indi- 
cates that the entry can be used. When P = in 
either level of page tables, the entry is not valid for 
address translation, and the rest of the entry is avail- 
able for software use; none of the other bits in the 
entry is tested by the hardware. If P = In either 
level of page tables when an attempt is made to use 
a page-table entry for address translation, the proc- 
essor signals either a data-access fault or an in- 
struction-access fault. In software systems that sup- 
port paged virtual memory, the trap handler can 
bring the required page into physical memory. 

Note that there is no P bit for the page directory 
itself. The page directory may be not-present while 
the associated process is suspended, but the oper- 
ating system must ensure that the page directory 
indicated by the dirbase image associated with the 
process is present in physical memory before the 
process is dispatched. 



2.4.4.1 Page Frame Address 

The page frame address specifies the physical start- 
ing address of a page. Because pages are located 
on 4K boundaries, the low-order 12 bits are always 
zero. In a page directory, the page frame address is 
the address of a page table. In a second-level page 
table, the page frame address is the address of the 
page frame that contains the desired memory oper- 
and. 



2.4.4.3 Writable and User Bits 

The W (writable) and U (user) bits are used for page- 
level protection, which the 860 microprocessor per- 
forms at the same time as address translation. The 
concept of privilege for pages is implemented by as- 
signing each page to one of two levels: 

1 . Supervisor level (U = 0)— for the operating sys- 
tem and other systems software and related data. 

2. User level (U = 1) — for applications procedures 
and data. 
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Figure 2. 11. Format of a Page Table Entry 
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The U bit of the psr indicates whether the 860 mi- 
croprocessor is executing at user or supervisor level. 
The 860 microprocessor maintains the U bit of psr 
as follows: 

• The 860 microprocessor clears the psr U bit to 
indicate supervisor level when a trap occurs (in- 
cluding when the trap instruction causes the 
trap). The prior value fo U is copied into PL). 

• The 860 microprocessor copies the psr PU bit 
into the U bit when an indirect branch is executed 
and one of the trap bits is set. If PU was one, the 
860 microprocessor enters user level. 

With the U bit of psr and the W and U bits of the 
page table entries, the 860 microprocessor imple- 
ments the following protection rules: 

• When at user level, a read or write of a supervi- 
sor-level pages causes a trap. 

• When at user level, a write to a page whose W bit 
is not set causes a trap. 

• When at user level, st.c to certain control regis- 
ters is ignored. 

When the 860 microprocessor is executing at super- 
visor level, all pages are addressable, but, when it is 
executing at user level, only pages that belong to the 
user-level are addressable. 

When the 860 microprocessor is executing at super- 
visor level, all pages are readable. Whether a page 
is writable depends upon the write-protection mode 
controlled by WP of epsr: 

WP = All pages are writable. 

WP = 1 A write to page whose W bit is not 

set causes a trap. 

When the 860 microprocessor is executing at user 
level, only pages that belong to user level and are 
marked writable are actually writable; pages that be- 
long to supervisor level are neither readable nor wri- 
table from user level. 

2.4.4.4 Write-Through Bit 

The 860 microprocessor does not implement a 
write-through caching policy for the on-chip instruc- 
tion and data caches; however, the WT (write- 
through) bit in the second-level page-table entry 
does determine internal caching policy. If WT is set 
In a PTE, on-chip caching of data from the corre- 
sponding page is inhibited. If WT is clear, the normal 
write-back policy is applied to data from the page in 
the on-chip caches. The WT bit of page directory 
entries is not referenced by the processor, but is 
reserved. 

The WT bit is Independent of the CD bit; therefore, 
data may be placed in a second-level coherent 
cache, but kept out of the on-chip caches. 



2.4.4.5 Cache Disable Bit 

If the CD (cache disable) bit in the second-level 
page-table entry is set, data from the associated 
page is not placed in external instruction or data 
caches. Clearing CD permits the external cache 
hardware to place data from the associated page 
into external caches. The CD bit of page directory 
entries is not referenced by the processor, but is 
reserved. 



2.4.4.6 Accessed and Dirty Bits 

The A (accessed) and D (dirty) bits provide data 
about page usage in both levels of the page tables. 

The 860 microprocessor sets the corresponding ac- 
cessed bits in both levels of page tables before a 
read or write operation to a page. The processor 
tests the dirty bit in the second-level page table be- 
fore a write to an address covered by that page table 
entry, and, under certain conditions, causes traps. 
The trap handler than has the opportunity to main- 
tain appropriate values In the dirty bits. The dirty bit 
in directory entries is not tested by the 860 micro- 
processor. The precise algorithm for using these bits 
Is specified in Subsection 2.4.5. 

An operating system that supports paged virtual 
memory can use these bits to determine what pages 
to eliminate from physical memory when the de- 
mand for memory exceeds the physical memory 
available. The D and A bits in the PTE (page-table 
entry) are normally initialized to zero by the operat- 
ing system. The processor sets the A bit when a 
page is accessed either by a read or write operation. 
When a data- or instruction-access fault occurs, the 
trap handler sets the D bit if an allowable write is 
being performed, then re-executes the instruction. 

The operating system Is responsible for coordinating 
its updates to the accessed and dirty bits with up- 
dates by the CPU and by other processors that may 
share the page tables. The 860 microprocessor au- 
tomatically asserts the LOCK# signal while testing 
and setting the A bit. 

2.4.4.7 Combining Protection of Both Leveis of 
Page Tables 

For any one page, the protection attributes of Its 
page directory entry may differ from those of its 
page table entry. The 860 microprocessor computes 
the effective protection attributes for a page by ex- 
amining the protection attributes in both the directo- 
ry and the page table. Table 2.6 shows the effective 
protection provided by the possible combinations of 
protection attributes. 
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2.4.5 ADDRESS TRANSLATION ALGORITHM 

The algorithm below defines the translation of each 
virtual address to a physical address. Let DIR, 
PAGE, and OFFSET be the fields of the virtual ad- 
dress; let PFA1 and PFA2 be the page frame ad- 
dress fields of the first and second level page tables 
respectively; DTB is the page directly table base ad- 
dress stored in the dirbase register. 

1. Assert LOCK#. 

2. Read the PTE (page table entry) at the physical 
address formed by DTB:DIR:00. 

3. If P in the PTE is zero, generate a data- or instruc- 
tion-access fault. 

4. If W in the PTE is zero, the operation is a write, 
and either the U-bit of the PSR is set or WP = 1 , 
generate a data- or instruction-access fault. 

5. If the U-bit in the PTE is zero and the U-bit in the 
psr is set, generate a data- or instruction-access 
fault. 

6. If A in the PTE is zero, set A. 

7. Locate the PTE at the physical address formed by 
PFA1:PAGE:00. 

8. Perform the P, A, W, and U checks as in steps 3 
through 6 with the second-level PTE. 

9. If D in the PTE is clear and the operation is a 
write, generate a data- or instruction-access fault. 



10. Form the physical address as PFA2:0FFSET. 

11. Deassert LOCK#. 

2.4.6 ADDRESS TRANSLATION FAULTS 

The address translation fault is one instance of the 
data-access fault. The instruction causing the fault 
can be re-executed upon returning from the trap 
handler. 

2.4.7 PAGE TRANSLATION CACHE 

For greatest efficiency in address translation, the 
860 microprocessor stores the most recently used 
page-table data in an on-chip cache called the TLB 
(translation lookaside buffer). Only if the necessary 
paging information is not In the cache must both lev- 
els of page tables be referenced. 

2.5 Caching and Cache Flushing 

The 860 microprocessor has the ability to cache in- 
struction, data, and address-translation information 
in on-chip caches. Caching may use virtual-address 
tags. The effects of mapping two different virtual ad- 
dresses in the same address space to the same 
physical address are undefined. 
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NOTES: 

U = — Supervisor W = — Read only 

U = 1— User W = 1— Read and write 

X indicates that, when the combined U attribute is supervisor 
and WP = 0, the W attribute is not checked. 
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Instruction, data, and address-translation caching on 
the 860 microprocessor are not transparent. Writes 
do not immediately update memory, the TLB, nor the 
instruction cache. Writes to memory by other bus 
devices do not update the caches. Under certain cir- 
cumstances, such as I/O references, self-modifying 
code, page-table updates, or shared data in a multi- 
processing system, it is necessary to bypass or to 
flush the caches. 860 microprocessor provides the 
following methods for doing this: 

• Bypassing Instruction and Data Caches. If 

deasserted during cache-miss processing, the 
KEN# pin disables instruction and data caching 
of the referenced data. If the CD bit from the as- 
sociated second-level PTE is set, caching of data 
and instructions is disabled. The value of the CD 
bit is output on the PTB pin for use by external 
caches. 

• Flushing Instruction and Address-Translation 
Caches. Storing to the dirbase register with the 
IT! bit set invalidates the contents of the instruc- 
tion and address-translation caches. This bit 



should be set when a page table or a page con- 
taining code is modified or when changing the 
DTB field of dirbase. Note that in order to make 
the instruction or address-translation caches con- 
sistent with the data cache, the data cache must 
be flushed before invalidating the other caches. 

NOTE: 

The mapping of the page(s) containing the cur- 
rently executing instruction, the next six instruc- 
tions, and any data referenced by these instruc- 
tions should not be different in the new page 
tables when the DTB is changed. 

Flushing the Data Cache. The data cache is 
flushed by a software routine using the flush in- 
struction. The data cache must be flushed prior to 
flushing the instruction or address-translation 
cache (as controlled by the ITI bit of dirbase) or 
enabling or disabling address translation (via the 
ATE bit). While the cache is being flushed, no 
interrupt or trap routines should be executed that 
load sharable data into the cache. 
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2.6 Instruction Set 

Table 2.7 shows the complete set of instructions 
grouped by function within processing unit. Refer to 
Section 8 for an algorithmic definition of each in- 
struction. 

The architecture of the 860 microprocessor uses 
parallelism to increase the rate at which operations 
may be introduced Into the unit. Parallelism in the 
860 microprocessor is not transparent; rather, pro- 
grammers have complete control over parallelism 
and therefore can achieve maximum performance 
for a variety of computational problems. 



2.6.1 PIPELINED AND SCALAR OPERATIONS 

One type of parallelism used within the floating-point 
unit is "pipelining". The pipelined architecture treats 
each operation as a series of more primitive opera- 
tions (called "stages") that can be executed in par- 
allel. Consider just the floating-point adder unit as an 
example. Let A represent the operation of the adder. 
Let the stages be represented by Ai, A2, and A3. 
The stages are designed such that Ai+ 1 for one ad- 
der instruction can execute in parallel with Aj for the 
next adder instruction. Furthermore, each Aj can be 
executed in just one clock. The pipelining within the 
multiplier and vector-integer units can be described 
similarly, except that the number of stages may be 
different. 

Figure 2.7 illustrates three-stage pipelining as found 
in the floating-point adder (also in the floating-point 
multiplier when single-precision input operands are 
employed). The columns of the figure represent the 
three stages of the pipeline. Each stage holds inter- 
mediate results and also (when introduced into first 
stage by software) holds status information pertain- 
ing to those results. The figure assumes that the In- 
struction stream consists of a series of consecutive 
floating-point instructions, all of one type (i.e. all ad- 
der instructions or all single-precision multiplier in- 
structions). The instructions are represented as i, 
i+ 1, etc. The rows of the figure represent the states 
of the unit at successive clock cycles. Each time a 
pipelined operation is performed, the result of the 
last stage of the pipeline is stored in the destination 
register rdest, the pipeline is advanced one stage, 
and the input operands src1 and src2 are trans- 
ferred to the first stage of the pipeline. 



In the 860 microprocessor, the number of pipeline 
stages ranges from one to three. A pipelined opera- 
tion with a three-stage pipeline stores the result of 
the third prior operation. A pipelined operation with a 
two-stage pipeline stores the result of the second 
prior operation. A pipelined operation with a one- 
stage pipeline stores the result of the prior opera- 
tion. 

There are four floating-point pipelines: one for the 
multiplier, one for the adder, one for the vector-Inte- 
ger unit, and one for floating-point loads. The adder 
pipeline has three stages. The number of stages in 
the multiplier pipeline depends on the precision of 
the source operands in the pipeline; it may have two 
or three stages. The vector-integer unit has one 
stage for all precisions. The load pipeline has three 
stages for all precisions. 

Changing the FZ (flush zero), RM (rounding mode), 
or RR (result register) bits of fsr while there are re- 
sults in either the multiplier or adder pipeline produc- 
es effects that are not defined. 



2.6.1.1 Scalar Mode 

In addition to the pipelined execution mode, the 860 
microprocessor also can execute floating-point in- 
structions in "scalar" mode. Most floating-point in- 
structions have both pipelined and scalar variants, 
distinguished by a bit in the instruction encoding. In 
scalar mode, the floating-point unit does not start a 
new operation until the previous floating-point oper- 
ation is completed. The scalar operation passes 
through all stages of its pipeline before a new opera- 
tion is introduced, and the result is stored automati- 
cally. Scalar mode is used when the next operation 
depends on results from the previous few floating- 
point operations (or when the compiler or program- 
mer does not want to deal with pipelining). 

2.6.1.2 Pipelining Status Information 

Result status information in the fsr consists of the 
AA, AI, AO, AU, and AE bits, in the case of the ad- 
der, and the MA, Ml, MO, and ML) bits, in the case of 
the multiplier. This information arrives at the fsr via 
the pipeline in one of two ways: 
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Table 2.7. Instruction Set 



Core Unit 


l\/lnemonic 


Description 


Load and Store Instructions 


ld.x 

st.x 

fid.y 

pfld.z 

fst.y 

pstd 


Load integer 
Store integer 
F-P load 

Pipelined F-P load 
F-P store 
Pixel store 


Register to Register l\/loves 


ixfr 
fxfr 


Transfer integer to F-P register 
Transfer F-P to integer register 


Integer Arithmetic Instructions 


addu 
adds 
subu 
subs 


Add unsigned 
Add signed 
Subtract unsigned 
Subtract signed 


Shift Instructions 


shI 
shr 
shra 
shrd 


Shift left 

Shift right 

Shift right arithmetic 

Shift right double 


Logical Instructions 


and 

andh 

andnot 

andnoth 

or 

orh 

xor 

xorh 


Logical AND 
Logical AND high 
Logical AND NOT 
Logical AND NOT high 
Logical OR 
Logical OR high 
Logical exclusive OR 
Logical exclusive OR high 


Control-Transfer Instructions 


trap 

intovr 

br 

bri 

be 

bet 

bnc 

bnc.t 

bte 

btne 

bla 

call 

calli 


Software trap 

Software trap on integer overflow 

Branch direct 

Branch indirect 

Branch on CC 

Branch on CC taken 

Branch on not CC 

Branch on not CC taken 

Branch if equal 

Branch if not equal 

Branch on LCC and add 

Subroutine call 

Indirect subroutine call 


System Control Instructions 


flush 

Id.c 

st.c 

lock 

unlock 


Cache flush 

Load from control register 
Store to control register 
Begin interlocked sequence 
End interlocked sequence 



Floating-Point Unit 


l\/lnemonic 


Description 


F-P Multiplier Instruction 


fmul.p 


F-P multiply 


pfmul.p 


Pipelined F-P multiply 


pfmul3.dd 


3-Stage pipelined F-P multiply 


fmlow.p 


F-P multiply low 


frcp.p 


F-P reciprocal 


frsqr.p 


F-P reciprocal square root 


F-P Adder Instructions 


fadd.p 


F-P add 


pfadd.p 


Pipelined F-P add 


fsub.p 


F-P subtract 


pfsub.p 


Pipelined F-P subtract 


pfgt-P 


Pipelined F-P greater-than compare 


pfeq.p 


Pipelined F-P equal compare 


fix.p 


F-P to integer conversion 


pfix.p 


Pipelined F-P to integer conversion 


ftrunc.p 


F-P to integer truncation 


pftrunc.p 


Pipelined F-P to integer truncation 


Dual-Operation Instructions 


pfam.p 


Pipelined F-P add and multiply 


pfsm.p 


Pipelined F-P subtract and multiply 


pfmam 


Pipelined F-P multiply with add 


pfmsm 


Pipelined F-P multiply with subtract 


Long Intege 


jr Instructions 


fisub.z 


Long-integer subtract 


pfisub.z 


Pipelined long-integer subtract 


fiadd.z 


Long-integer add 


pfiadd.z 


Pipelined long-integer add 


Graphics Instructions 


fzchks 


1 6-bit Z-buffer check 


pfzchks 


Pipelined 16-bit Z-buffer check 


fzchkl 


32-bit Z-buffer check 


pfzchkl 


Pipelined 32-bit Z-buffer check 


faddp 


Add with pixel merge 


pfaddp 


Pipelined add with pixel merge 


faddz 


Add with Z merge 


pfaddz 


Pipelined add with Z merge 


form 


OR with MERGE register 


pform 


Pipelined OR with MERGE register 



Assembler Pseudo-Operations 


Mnemonic 


Description 


mov 


Integer register-register move 


fmov.q 


F-P reg-reg move 


pfmov.q 


Pipelined F-P reg-reg move 


nop 


Core no-operation 


fnop 


F-P no-operation 


pfle.p 


Pipelined F-P less-than or equal 
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Figure 2.12. Pipelined Instruction Execution 



1. It is calculated by the last stage of the pipeline. 
This is the normal case. 

2. It is propagated from the first stage of the pipe- 
line. This method is used when restoring the state 
of the pipeline after a preemption. When a store 
instruction updates the fsr and the value of the 
U bit in the word being written into the fsr is set, 
the store updates the result status bits in the first 
stage of both the adder and multiplier pipelines. 
When software changes the result-status bits of 
the first stage of a particular unit (multiplier or ad- 
der), the updated result-status bits are propagat- 
ed one stage for each pipelined floating-point op- 
eration for that unit. In this case, each stage of the 
adder and multiplier pipelines holds its own copy 
of the relevant bits of the fsr. When they reach 
the last stage, they override the normal result- 
status bits computed from the last-stage result. 



At the next floating-point instruction (or at certain 
core instructions), after the result reaches the last 
stage, the 860 microprocessor traps if any of the 
status bits of the fsr indicate exceptions. Note that 
the instruction that creates the exceptional condition 
is not the instruction at which the trap occurs. 

2.6.1.3 Precision in the Pipelines 

In pipelined mode, when a floating-point operation is 
initiated, the result of an earlier pipelined floating- 
point operation is returned. The result precision of 
the current instruction applies to the operation being 
initiated. The precision of the value stored in rdest is 
that which was specified by the instruction that initia- 
ted that operation. 
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Figure 2.13. Dual-Instruction Mode Transitions 



If rdest is the same as src1 or src2, the value being 
stored in rdest Is used as the input operand. In this 
case, the precision of rdest must be the same as the 
source precision. 

The multiplier pipeline has two stages when the 
source operand is double-precision and three stages 
when the precision of the source operand is single. 
This means that a pipelined multiplier operation 
stores the result of the second previous multiplier 
operation for double-precision inputs and third previ- 
ous for single-precision inputs (except when chang- 
ing precisions). 

2.6.1.4 Transition between Scalar and Pipelined 
Operations 

When a scalar operation is executed, it passes 
through all stages of the pipeline; therefore, any un- 
stored results in the affected pipeline are lost. To 
avoid losing information, the last pipelined opera- 
tions before a scalar operation should be dummy 
pipelined operations that unload unstored results 
from the affected pipeline. 



After a scalar operation, the values of ail pipeline 
stages of the affected unit (except the last) are un- 
defined. No spurious result-exception traps result 
when the undefined values are subsequently stored 
by pipelined operations; however, the values should 
not be referenced as source oeprands. 

Fpr best performance a scalar operation should not 
immediately precede a pipelined operation whose 
rdest is nonzero. 



2.6.2 DUAL-INSTRUCTION MODE 

Another form of parallelism results from the fact that 
the 860 microprocessor can execute both a floating- 
point and a core instruction simultaneously. Such 
parallel execution is called [dual-instruction mode]. 
When executing in dual-instruction mode, the in- 
struction sequence consists of 64-bit aligned instruc- 
tions with a floating-point instruction in the lower 32 
bits and a core instruction in the upper 32 bits. Table 
2.6 identifies which instructions are executed by the 
core unit and which by the floating-point unit. 
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Programmers specify dual-instruction mode eitiier 
by including in the mnemonic of a floating-point in- 
struction a d. prefix or by using the Assembler direc- 
tives .dual . . . .enddual. Both of the specifications 
cause the D-bit of floating-point instructions to be 
set. If the 860 microprocessor is executing in single- 
instruction mode and encounters a floating-point in- 
struction with the D-bit set, one more 32-bit instruc- 
tion is executed before dual-mode execution begins. 
If the 860 microprocessor is executing in dual-in- 
struction mode and a floating-point instruction is en- 
countered with a clear D-bit, then one more pair of 
instructions is executed before resuming single-in- 
struction mode. Figure 2.13 illustrates two variations 
of this sequence of events: one for extended se- 
quences of dual-instructions and one for a single in- 
struction pair. 

When a 64-bit dual-instruction pair sequentially fol- 
lows a delayed branch instruction in dual-instruction 
mode, both 32-bit instructions are executed. 

2.6.3 DUAL-OPERATION INSTRUCTIONS 

Special dual-operation floating-point instructions 
(add-and-multiply, subtract-and-multiply) use both 
the multiplier and adder units within the floating- 
point unit in parallel to efficiently execute such com- 
mon tasks as evaluating systems of linear equa- 
tions, performing the Fast Fourier Transform (FFT), 
and performing graphics transformations. 

The Instructions pfam srcl, src2, rdest (a66 and 
multiply), pfsm srcl, src2, rdest (subtract and multi- 
ply), pf mam scr1, src2, rdest (multiply and add), and 
pfmsm src1, src2, rdest (multiply and subtract) initi- 
ate both an adder operation and a multiplier opera- 
tion. Six operands are required, but the instruction 
format specifies only three operands; therefore, 
there are special provisions for specifying the oper- 
ands. These special provisions consist of: 

• Three special registers (KR, Kl, and T), that can 
store values from one dual-operation instruction 
and supply them as inputs to subsequent dual- 
operation instructions. 

1 . The constant registers KR and Kl can store the 
value of src1 and subsequently supply that val- 
ue to the multiplier pipeline in place of src1. 

2. The transfer register T can store the last-stage 
result of the multiplier pipeline and subse- 
quently supply that value to the adder pipeline 
in place of src/. 

• A four-bit data-path control field In the opcode 
(DPC) that specifies the operands and loading of 
the special registers. 

1. Operand-1 of the multiplier can be KR, Kl, or 
src1. 

2. Operand-2 of the multiplier can be src2 or the 
last-stage result of the adder pipeline. 



3. Operand-1 of the adder can be src1, the T-reg- 
ister, or the last-stage result of the adder pipe- 
line. 

4. Operand-2 of the adder can be src2, the last- 
stage result of the multiplier pipeline, or the 
last-stage result of the adder pipeline. 

Figure 2.14 shows all the possible data paths sur- 
rounding the adder and multiplier. A DPC field in 
these instructions select different data paths. Sec- 
tion 8 shows the various encodings of the DPC field. 



SRC1 












SRC2 


RDE 














M ^ 








. .1 








i i . 












i i 






0P1 0P2 








MULTIPLIER UNIT 












RESULT 












1 






^ ^ 




., Tl ii. 


, 






i i 






0P1 0P2 






ADDER UNIT 








RESULT 



















Figure 2.14. Dual-Operation Data Paths 

Note that the mnemonics pfam.p, pfsm.p, 
pfmam.p, and pfmsm.p are never used as such in 
the assembly language; these mnemonics are used 
here to designate classes of related instructions. 
Each value of DPC has a unique mnemonic associ- 
ated with it. 



2.7 Addressing Modes 

Data access is limited to load and store instructions. 
Memory addresses are computed from two fields of 
load and store instructions: s/r/ and src2. 

^. srcl either contains the identifier of a 32-bit inte- 
ger register or contains an immediate 16-bit ad- 
dress offset. 

2. src2 always specifies a register. 
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Table 2.8. Types of Traps 



Type 


Indication 


Caused by 


PSR 


FSR 


Condition 


Instruction 


Instruction 
Fault 


IT 




Software traps 
Missing unlock 


trap, intovr 

Any 


Floating 
Point 
Fault 


FT 


SE 

AO,MO 
AU, MU 
AI.MI 


Floating-point source exception 
Floating-point result exception 

overflow 

underflow 

inexact result 


Any M- or A-unit except f mlow 
Any M- or A-unit except f mlow, pf gt, 
and pfeq. Reported on any F-P 
instruction plus pst, fst, and 
sometimes fid, pfid, ixfr 


Instruction 
Access Fault 


lAT 




Address translation exception 
during instruction fetch 


Any 


Data Access 
Fault 


DAT* 




Load/store address translation 

exception 
Misaligned operand address 
Operand address matches 

db register 


Any load/store 

Any load/store 
Any load/store 


Interrupt 


IN 


External interrupt 


Reset 


No trap bits set 


Hardware RESET signal 



*These cases can be distinguished by examining the operand addresses. 



Because either src1 or src2 may be null (zero), a 
variety of useful addressing modes result: 

offset + register Useful for accessing fields within 
a record, where register points 
to the beginning of the record. 
Useful for accessing items in a 
stack frame, where register is 
r3, the register used for pointing 
to the beginning of the stack 
frame. 

register + register Useful for two-dimensional ar- 
rays or for array access within 
the stack frame. 



register 
offset 



Useful as the end result of any 
arbitrary address calculation. 

Absolute address into the first 
64K of the logical address 
space. 



In addition, the floating-point load and store instruc- 
tions may select autoincrement addressing. In this 
mode src2 is replaced by the sum of src1 and src2 
after performing the load or store. This mode makes 
stepping through arrays more efficient, because it 
eliminates one address-calculation instruction. 



2.8 Interrupts and Traps 

Traps are caused by exceptional conditions detect- 
ed in programs or by external interrupts. Traps 



cause interruption of normal program flow to exe- 
cute a special program known as a trap handler. 
Traps are divided into the types shown in Table 2.8. 



2.8.1 TRAP HANDLER INVOCATION 

This section applies to traps other than reset. When 
a trap occurs, execution of the current instruction is 
aborted. The instruction is restartable. The proces- 
sor takes the following steps while transferring con- 
trol to the trap handler: 

1 . Copies U (user mode) of the psr into PU (previous 
U). 

2. Copies IM (interrupt mode) into PIM (previous IM). 

3. Sets U to zero (supervisor mode). 

4. Sets IM to zero (interrupts disabled). 

5. If the processor is in dual instruction mode, it sets 
DIM; otherwise it clears DIM. 

6. If the processor is in single-instruction mode and 
the next instruction will be executed in dual- 
instruction mode or if the processor is in dual-in- 
struction mode and the next instruction will be 
executed in single-instruction mode, DS is set; 
otherwise, it is cleared. 

7. The appropriate trap type bits in psr are set (IT, 
IN, lAT, DAT, FT). Several bits may be set if the 
corresponding trap conditions occur simulta- 
neously. 

8. An address is placed in the fault instruction regis- 
ter (fir) to help locate the trapped instruction. In 
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single-instruction mode, the address in fir is tiie 
address of the trapped instruction itself. In dual-in- 
struction mode, the address in fir Is that of the 
floating-point half of the dual instruction. If an in- 
struction or data access fault occurred, the asso- 
ciated core instruction is the high-order half of the 
dual Instruction (fir + 4). In dual-instruction 
mode, when a data access fault occurs in the ab- 
sence of other trap conditions, the floating-point 
half of the dual instruction will already have been 
executed (except in the case of the fxfr 
instruction). 

The processor begins executing the trap handler by 
transferring execution to address OxFFFFFFOO. The 
trap handler begins execution in single-instruction 
mode. The trap handler must examine the trap-type 
bits in psr (IT, IN, I AT, DAT, FT) to determine the 
cause or causes of the trap. 

2.8.2 INSTRUCTION FAULT 

This fault is caused by any of the following condi- 
tions. In all cases the processor sets thB IT bit be- 
fore entering the trap handler. 

• By the trap instruction. 

• By the intovr instruction. The trap occurs only if 
OF in epsr is set when intovr is executed. The 
trap handler should clear OF before returning. 

• By the lack of an unlock instruction within 32 in- 
structions of a lock. In this case IL is also set. 
When the trap handler finds IL set, it should scan 
backwards for the lock instruction and restart at 
that point. The absence of a lock instruction with- 
in 32 instructions of the trap indicates a program- 
ming error. 

2.8.3 FLOATING-POINT FAULT 

The floating-point fault occurs on floating-point in- 
structions pst, fst, and sometimes fid, pfid, ixfr. 
The floating-point faults of the 860 microprocessor 
support the floating-point exceptions defined by the 
IEEE standard as well as some other useful classes 
of exceptions. The 860 microprocessor divides 
these Into two classes: source exceptions and result 
exceptions. The numerics library supplied by Intel 
provides the IEEE standard default handling for all 
these exceptions. 

2.8.3.1 Source Exception Faults 

All exceptional operands, including infinities, denor- 
mallzed numbers and NaNs, cause a floating-point 
fault and set SE in the fsr. Source exceptions are 
reported on the instruction that initiates the opera- 
tion. For pipelined operations, the pipeline is not ad- 
vanced. 



The SE value Is undefined for faults on fid, pfId, fst, 
pst, and ixfr Instructions when in single-instruction 
mode or when in dual-instruction mode and the com- 
panion Instruction Is not a multiplier or adder opera- 
tion. 

2.8.3.2 Result Exception Faults 

The class of result exceptions Includes any of the 
following conditions: 

• Overflow. The absolute value of the rounded 
true result would exceed the largest positive finite 
number in the destination format. 

• Underflow (when FZ is clear). The absolute val- 
ue of the rounded true result would be smaller 
than the smallest positive finite number in the 
destination format. 

• Inexact result (when Tl is set). The result is not 
exactly representable in the destination format. 
For example, the fraction Vs cannot be precisely 
represented in binary form. This exception occurs 
frequently and indicates that some (generally ac- 
ceptable) accuracy has been lost. 

The point at which a result exception is reported de- 
pends upon whether pipelined operations are being 
used: 

• Scalar (nonpipelined) operations. Result ex- 
ceptions are reported on the next floating-point, 
fsLx, or pst.x (and sometimes fid, pfId, ixfr) in- 
struction after the scalar operation. When a trap 
occurs, the last-stage of the affected unit con- 
tains the result of the scalar operation. 

• Pipelined operations. Result exceptions are re- 
ported when the result is in the last stage and the 
next floating-point (and sometimes fid, pfId, ixfr) 
instruction is executed. When a trap occurs, the 
pipeline is not advanced, and the last-stage re- 
sults (that caused the trap) remain unchanged. 

When no trap occurs (either because FTE is clear or 
because no exception occurred), the pipeline is ad- 
vanced normally by the new floating-point operation. 
The result-status bits of the affected unit are unde- 
fined until the point that result exceptions are report- 
ed. At this point, the last-stage result-status bits (bits 
29..22 and 16..9 of the fsr) reflect the values in the 
last stages of both the adder and multiplier. For ex- 
ample, if the last-stage result in the multiplier has 
overflowed and a pipelined floating-point pfadd is 
started, a trap occurs and MO is set. 

For scalar operations, the RR bits of fsr specify the 
register in which the result was stored. RR is updat- 
ed when the scalar instruction is initiated. The trap, 
however, occurs on a subsequent instruction. Pro- 
grammers must prevent intervening stores to fsr 
from modifying the RR bits. Prevention may take one 
of the following forms: 
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• Before any store to fsr when a result exception 
may be pending, execute a dummy floating-point 
operation to trigger the result-exception trap. 

• Always read from fsr before storing to it, and 
mask updates so that the RR bits are not 
changed. 

For pipelined operations, RR is cleared; the result is 
in the pipeline of the appropriate unit. 

In either case, the result has the same fraction as 
the true result and has an exponent which is the low- 
order bits of the true result. The trap handler can 
inspect the result, compute the result appropriate for 
that instruction (a NaN or an infinity, for example), 
and store the correct result. The result is either 
stored in the register specified by RR (if nonzero) or 
in the last stage of the pipeline (if RR = 0). The trap 
handler must clear the result status for the last 
stage, then reexecute the trapping instruction. 

Result exceptions may be reported for both the ad- 
der and multiplier units at the same time. In this 
case, the trap handler should fix up the last stage of 
both pipelines. 

2.8.4 INSTRUCTION ACCESS FAULT 

This trap results from a page-not-present exception 
during instruction fetch. If a supervisor-level page Is 
fetched in user mode, an exception may or may not 
occur. 



2.8.5 DATA ACCESS FAULT 

This trap results from an abnormal condition detect- 
ed during data operand fetch or store. Such an ex- 
ception can be due only to one of the following caus- 
es: 

• An attempt is being made to write to a page 
whose D-bit is clear. 

• A memory operand is misaligned (is not located 
at an address that Is a multiple of the length of 
the data). 

• The address stored in the debug register is equal 
to one of the addresses spanned by the operand. 

• The operand is in a not-present page. 

• An attempt is being made from user level to write 
to a read-only page or to access a supervisor-lev- 
el page. 

2.8.6 INTERRUPT TRAP 

An Interrupt is an event that Is signaled from an ex- 
ternal source. If the processor is executing with in- 



terrupts enabled (IM set in the psr), the processor 
sets the interrupt bit IN in the psr, and generates an 
interrupt trap. Vectored interrupts are implemented 
by Interrupt controllers and software. 

2.8.7 RESET TRAP 

When the 860 microprocessor is reset, execution 
begins in single-instruction mode at address 
OxFFFFFFOO. This is the same address as for other 
traps. The reset trap can be distinguished from other 
traps by the fact that no trap bits are set. The in- 
struction cache is flushed. The bits DPS, BL, and 
ATE in dirbase are cleared. CS8 is initialized by the 
value at the INT pin at the end of reset. The bits U, 
IM, BR, and BW in psr are cleared. All other bits of 
psr and all other register contents are undefined. 

The software must ensure that the data cache is 
flushed and control registers are properly initialized 
before performing operations that depend on the 
values of the cache or registers. 

Reset code must initialize the floating-point pipeline 
state to zero with floating-point traps disabled to en- 
sure that no spurious floating-point traps are gener- 
ated. 

After a RESET the 860 microprocessor starts execu- 
tion at supervisor level (U = 0). Before branching to 
the first user-level instruction, the RESET trap han- 
dler or subsequent initialization code has to set PU 
and a trap bit so that an indirect branch instruction 
will copy PU to U, thereby changing to user level. 



2.9 Debugging 

The 860 microprocessor supports debugging with 
both data and instruction breakpoints. The features 
of the 860 architecture that support debugging In- 
clude: 

• db (data breakpoint register) which permits speci- 
fication of a data addresses that the 860 micro- 
processor will monitor. 

• BR (break read) and BW (break write) bits of the 
psr, which enable trapping of either reads or 
writes (respectively) to the address in db. 

• DAT (data access trap) bit of the psr, which al- 
lows the trap handler to determine when a data 
breakpoint was the cause of the trap. 

• trap Instruction that can be used to set break- 
points in code. Any number of code breakpoints 
can be set. The values of the srcl and src2 fields 
help identify which breakpoint has occurred. 

• IT (Instruction trap) bit of the psr, which allows 
the trap handler to determine when a trap 
instruction was the cause of the trap. 
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3.0 HARDWARE INTERFACE 

In the following description of hardware interface, 
the # symbol at the end of a signal name indicates 
that the active or asserted state occurs when the 
signal is at a low voltage. When no # is present after 
the signal name, the signal is asserted when at the 
high voltage level. 



3.1 Signal Description 

Table 3.1 identifies functional groupings of the pins, 
lists every pin by its identifier, gives a brief descrip- 
tion of its function, and lists some of its characteris- 
tics. All output pins are tristate, except HLDA and 
BREQ. All inputs are synchronous, except HOLD 
and INT. 



3.1.1 CLOCK (CLK) 

The CLK input determines execution rate and timing 
of the S60 microprocessor. Timing of other signals is 
specified relative to the rising edge of this signal. 
The 860 microprocessor can utilize a clock rate of 
33.3 MHz. The Internal operating frequency Is the 
same as the external clock. This signal is TTL com- 
patible. 



3.1.2 SYSTEM RESET (RESET) 

Asserting RESET for at least 16 CLK periods causes 
initialization of the 860 microprocessor. Refer to 
section 3.2 "Initialization" for more details related to 
RESET. 



3.1.3 BUS HOLD (HOLD) AND BUS HOLD 
ACKNOWLEDGE (HLDA) 

These pins are used for 860 microprocessor bus ar- 
bitration. At some time after the HOLD signal is as- 
serted, the 860 microprocessor releases control of 
the local bus and puts all bus interface outputs (ex- 
cept BREQ and HLDA) in floating state, then asserts 
HLDA — all during the same clock period. It main- 
tains this state until HOLD is deasserted. Instruction 
execution stops only if required instructions or data 
cannot be read from the on-chip instruction and data 
caches. 

The time required to acknowledge a hold request is 
one clock plus the number of clocks needed to finish 
any outstanding bus cycles. HOLD is recognized 
even while RESET is asserted. 

When leaving a bus hold, the 860 microprocessor 
deactivates HLDA and, in the same clock period, ini- 
tiates a pending bus cycle, if any. 

Hold is an asynchronous input. 
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Table 3.1. Pin Summary 



Pin 
Name 


Function 


Active 
State 


Input/ 
Output 


Execution Control Pins 


CLK 

RESET 

HOLD 

HLDA 

BREQ 

INT/CS8 


CLOCK 

System reset 

Bus hold 

Bus hold acknowledge 

Bus request 

Interrupt, code-size 


High 
High 
High 
High 
High 


1 
1 

1 




1 


Bus Interface Pins 


A31-A3 

BE7#-BE0# 

D63~D0 

LOCK# 

W/R# 

NENE# 

NA# 

READY# 

ADS# 


Address bus 

Byte Enables 

Data bus 

Bus lock 

Write/ Read bus cycle 

NExt NEar 

Next Address request 

Transfer Acknowledge 

ADdress Status 


High 
Low 
High 
Low 
Hi/Low 
Low 
Low 
Low 
Low 




o 

I/O 




1 

1 




Cache Interface Pins 


KEN# 
PTB 


Cache ENable 
Page Table Bit 


Low 
High 


1 




Testability Pins 


SHI 

BSCN 

SCAN 


Boundary Scan Shift Input 
Boundary Scan Enable 
Shift Scan Path 


High 
High 
High 


1 

1 
1 


Intel-Reserved Configuration Pins 


CC1-CC0 


Configuration 


High 


1 


Power and Ground Pins 


Vcc 
Vss 


System power 
System ground 







A # after a pin name indicates that the signal is active when at the low voltage level. 



3.1.4 BUS REQUEST (BREQ) 

This signal is asserted when the 860 microprocessor 
has a pending memory request, even when HLDA is 
asserted. This allows an external bus arbiter to im- 
plement an "on demand only" policy for granting the 
bus to the 860 microprocessor. 



3.1.5 INTERRUPT/CODE-SIZE (INT/CS8) 

This input allows interruption of the current instruc- 
tion stream. If interrupts are enabled (IM set in psr) 
when INT is asserted, the 860 microprocessor fetch- 
es the next instruction from address OxFFFFFFOO. 
To assure that an interrupt is recognized, INT should 
remain asserted until the software acknowledges 
the interrupt (by writing, for example, to a memory- 



mapped port of an interrupt controller). The maxi- 
mum time between the assertion of INT and execu- 
tion of the first instruction of the trap handler is 1 
clocks, plus the time for eight nonpipelined read cy- 
cles (four TLB misses), plus the time for eight non- 
pipelined writes (updates to the A bit), plus the time 
for three sets of four pipelined read cycles and two 
sets of four pipelined writes (instruction and data 
cache misses and write-back cycles to update mem- 
ory). 

If INT is asserted during the clock before the falling 
edge of RESET, the eight-bit code-size mode is se- 
lected. For more about this mode, refer to section 
3.2 "Initialization". 

INT is an asynchronous Input. 
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3.1.6 ADDRESS PINS (A31-A3) AND BYTE 
ENABLES (BE7#-BE0#) 

The 29-bit address bus (A31 -A3) identifies address- 
es to a 64-bit location. Separate byte-enable signals 
(BE7#-BE0#) identify which bytes should be ac- 
cessed within the 64-bit location. Cache reads 
should return 64 bits without regard for the byte-en- 
able signals. 

Instruction fetches (W/R# is low) are distinguished 
from data accesses by the unique combinations of 
BE7#-BE0# defined in Table 3.2. For an eight-bit 
code fetch in eight-bit code-size (CS8) mode, 
BE2#-BE0# are redefined to be A2-A0 of the ad- 
dress. In this case BE7#-BE3# form the code 
shown in Table 3.2 that identifies an instruction 
fetch. 

3.1.7 DATA PINS (D63-D0) 

The bus interface has 64 bidirectional data pins 
(D63-D0) to transfer data in eight- to 64-bit quanti- 
ties. Pins D7-D0 transfer the least significant byte; 
pins D63-D56 transfer the most significant byte. 

In write bus cycles, the point at which data is driven 
onto the bus depends on the type of the preceding 
cycle. If there was no preceding cycle (i.e. the bus 
was idle), data is driven with the address. If the pre- 
ceding cycle was a write, data is driven as soon as 
READY # is returned from the previous cycle. If the 
preceding cycle was a read, data is driven one clock 
after READY # is returned from the previous cycle, 
thereby allowing time for the bus to be turned 
around. 

3.1.8 BUS LOCK (LOCK#) 

This signal is used to provide atomic (indivisible) 
read-modify-write sequences in multiprocessor sys- 
tems. Once the external bus arbiter has accepted a 
memory access for a locked bus cycle from the 860 
microprocessor, it should not accept locked cycles 
(or any cycles, depending on software convention) 
from other bus masters until LOCK# is deasserted. 

The 860 microprocessor coordinates the external 
LOCK# signal with the software-controlled BL bit of 



the dirbase register. Programmers do not have to 
be concerned about the fact that bus activity is not 
always synchronous with instruction execution. 
LOCK# is asserted with ADS# for the first bus cycle 
that results from an instruction executed after the BL 
bit is set. Even if the BL bit is changed between the 
time that an instruction generates an internal bus 
request and the time that the cycle appears on the 
bus, the 860 microprocessor still asserts LOCK# for 
that bus cycle. LOCK# is deasserted with ADS# for 
the next bus cycle that results from an instruction 
executed after the BL bit is cleared. 

The 860 microprocessor also asserts LOCK# during 
TLB miss processing for updates of the accessed bit 
in page-table entries. The maximum time that 
LOCK# can be asserted in this case is five clocks 
plus the time required by software to perform a read- 
modify-write sequence. 

The 860 microprocessor does not acknowledge bus 
hold requests while LOCK# is asserted. 

3.1.9 WRITE/READ BUS CYCLE (W/R#) 

This pin specifies whether a bus cycle is a read 
(LOW) or write (HIGH) cycle. 

3.1.10 NEXT NEAR (NENE#) 

This signal allows higher-speed reads and writes in 
the case of consecutive reads and writes that ac- 
cess static column or page-mode DRAMs. The 860 
microprocessor asserts NENE# when the current 
address is in the same DRAM page as the previous 
bus cycle. The 860 microprocessor determines the 
DRAM page size by inspecting the DPS field in the 
dirbase register. The page size can range from 29 to 
216 64-bit words, supporting DRAM sizes from 256K 
X 1, 256K X 4, and up. NENE# is never asserted 
on the next bus cycle after HLDA is deasserted. 

3.1.11 NEXT ADDRESS REQUEST (NA#) 

NA# makes address pipelining possible. The sys- 
tem asserts NA# to indicate that it is ready to ac- 



Table 3.2. Identifying Instruction Fetches 



Code 
Fetch 


A2 


BE7# 


BE6# 


BE5# 


BE4# 


BE3# 


BE2# 


BE1# 


BEO# 


Normal 
(Non-CS8) 





1 


1 


1 


1 


1 





1 





Normal 
(Non-CS8) 


1 


1 





1 





1 


1 


1 


1 


CS8 
Mode 


X 


1 





1 





X 


Low-order address bits 



33 



iiteT 



I860TM MICROPROCESSOR 



A©mM©i 0[}!OIF@[^IM1A¥0@M 



cept the next address from the 860 microprocessor. 
NA# may be asserted before the current cycle 
ends. (If the system does not implement pipelining, 
NA# does not have to be activated.) The 860 micro- 
processor samples NA# every clock, starting one 
clock after the prior activation of ADS#. When NA# 
is active, the 860 microprocessor is free to drive ad- 
dress and bus-cycle definition for the next pending 
bus cycle. The 860 microprocessor remembers that 
NA# was asserted when no internal request is 
pending; therefore, NA# can be deactivated after 
the next rising edge of the CLK signal. Up to three 
bus cycles can be outstanding simultaneously on 
the processor's bus. 

3.1.12 TRANSFER ACKNOWLEDGE (READY#) 

The system asserts the READY # signal during read 
cycles when valid data is on the data pins and during 
a write cycles when the system has accepted data 
from the data pins. READY # is sampled one clock 
after prior ADS# or prior READY # in case of pipe- 
lining. 

3.1.13 ADDRESS STATUS (ADS#) 

The 860 microprocessor asserts ADS# during the 
first clock of each bus cycle to identify the clock 
period during which it begins to assert outputs on 
the address bus. This signal is not held active during 
a pipelined bus cycle. This allows two-level pipelin- 
ing, for a maximum of three outstanding cycles. 

3.1.14 CACHE ENABLE (KEN#) 

The 860 microprocessor samples KEN# to deter- 
mine whether the data being read for the current 
cache-miss cycle is to be cached. This pin is inter- 
nally NORed with the PTB pin to control cache abili- 
ty on a page by page basis (refer to Table 3.3). 

If the address in one that is permitted to be in the 
cache, KEN # must be continuously asserted during 
the sampling period starting from the clock after 
ADS# is asserted, through the clock NA# or 
READY # is asserted. The entire 64-bit of the data 
bus will be used for the read, regardless of the state 
of the byte-enable pins. Three additional 64-bit bus 
cycles will generate to fill the rest of the 32-byte 
cache block. KEN# must continue to be asserted 
for each of these cycles as well. 

If KEN # is found deasserted at any time during the 
above-described sampling period, the data being 
read will not cached and two scenarios can occur: 1 ) 
if the cycle is due to data-cache miss, no subse- 
quent cache-fill cycles will be generated; 2) if the 
cycle is due to an instruction-cache miss, additional 
cycle(s) will be generated until the address reaches 
a 32-byte boundary. 



3.1.15 PAGE TABLE BIT (PTB) 

Depending on the setting of the PBM (page-table bit 
mode) bit of the epsr, the PTB reflects the value of 
either the CD (cache disable) bit or the WT (write 
through) bit of the page-table entry used for the cur- 
rent cycle. This pin is internally NORed with the 
KEN# pin to control cacheabillty on a page by page 
basis. Table 3.3 shows the relationship between 
PTB and KEN#. When paging is disabled, PTB re- 
mains inactive. 

Table 3.3. Cacheabillty based on KEN# and PTB 



PTB 


KEN# 


Meaning 




1 
1 




1 


1 


Cacheable access 
Noncacheable access 
Noncacheable page 
Noncacheable page 



3.1.16 BOUNDARY SCAN SHIFT INPUT (SHI) 

This pin is used with the testability features. Refer to 
section 3.4. 



3.1.17 BOUNDARY SCAN ENABLE (BSCN) 

This pin is used with the testability features. Refer to 
section 3.4. 



3.1.18 SHIFT SCAN PATH (SCAN) 

This pin is used with the testability features. Refer to 
section 3.3. 



3.1.19 CONFIGURATION (CC1-CC0) 

These two pins are reserved by Intel. Strap both pins 
LOW. 



3.1.20 SYSTEM POWER (Vcc) AND GROUND 
(Vss) 

The 860 microprocessor has 48 pins for power and 
ground. All pins must be connected to the appropri- 
ate low-inductance power and ground signals in the 
system. 



3.2 Initialization 

Initialization of the 860 microprocessor is caused by 
assertion of the RESET signal for at least 16 clocks. 
Table 3.4 shows the status of output pins during the 
time that RESET is asserted. Note that HOLD re- 
quests are honored during RESET and that the 
status of output pins depends on whether a HOLD 
request is being acknowledged. 
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Table 3.4. Output Pin Status during Reset 



Pin Name 


Pin Value 


HOLD 

Not 

Acknowledged 


HOLD 
Acknowledged 


ADS#,LOCK# 


HIGH 


Tri-State OFF 


W/R#, PTB 


LOW 


Tri-State OFF 


BREQ 


LOW 


LOW 


HLDA 


LOW 


HIGH 


D63-D0 


Tri-State OFF 


Tri-State OFF 


A31-A3, 

BE7#-BE0#, 

NENE# 


Undefined 


Tri-State OFF 



After a reset, the 860 microprocessor begins execut- 
ing at address OxFFFFFFOO. The program-visible 
state of the 860 microprocessor after reset is de- 
tailed in section 2. 

Eight-bit code-size mode is selected when INT is as- 
serted during the clock before the falling edge of 
RESET. While in eight-bit code-size mode, instruc- 
tion cache misses are byte reads (transferred on D7- 
DO of the data bus) instead of eight-byte reads. This 
allows the 860 microprocessor to be bootstrapped 
from an eight-bit EPROM. For these code reads, 
byte enables BE2#-BE0# are redefined to be the 
low order three bits of the address, so that a com- 
plete byte address is available. These reads update 
the instruction cache if KEN# is asserted (refer to 
section 3.1.1.4) and are not pipelined even if NA# is 
asserted. While in this mode, instructions must re- 
side in an eight-bit wide memory, while data must 
reside in a separate 64-bit wide memory. After the 
code has been loaded into 64-bit memory, initializa- 
tion code can initiate 64-bit code fetches by clearing 
the CS8 bit of the dirbase register (refer to section 
2). Once eight-bit code-size mode is disabled by 
software, it cannot be reenabled except by resetting 
the 860 microprocessor. 



3.3 Testability 

The 860 microprocessor has a boundary scan mode 
that may be used in component- or board-level test- 
ing to test the signal traces leading to and from the 
860 microprocessor. Boundary scan mode provides 
a simple serial interface that makes it possible to 
test all signal traces with only a few probes. Probes 
need be connected only to CLK, BSCN, SCAN, SHI, 
and BREQ. 

The pins BSCN and SCAN control the boundary 
scan mode (refer to Table 3.5). When BSCN is as- 



serted, the 860 microprocessor enters boundary 
scan mode on the next rising clock edge. Boundary 
scan mode can be activated even while RESET is 
active. When BSCN is deasserted while in boundary 
scan mode, the 860 microprocessor leaves bounda- 
ry scan mode on the next rising clock edge. After 
leaving boundary scan mode, the internal state is 
undefined; therefore, RESET should be asserted. 





Table 3.5. Test Mode Selection 


BSCN 


SCAN 


Testability Mode 


LO 


LO 


No testability mode selected 


LO 


HI 


(Reserved for Intel) 


HI 


LO 


Boundary scan mode, normal 


HI 


HI 


Boundary scan mode, shift 
SHI as input; BREQ as 
output 



For testing purposes, each signal pin has associated 
with it an internal latch. Table 3.6 indentifies these 
latches by name and classifies them as input, out- 
put, or control. The input and output latches carry 
the name of the corresponding pins. 

Table 3.6. Test Mode Latches 



Input 
Latch 


Output 
Latch 


Associated 

Control 

Latch 


SHI 






BSCN 






SCAN 






RESET 






D0-D63 


D0-D63 


DATAt 


CC1-CC0 








A31-A3 


ADDRt 




NENE# 


NENEt 




PTB# 


PTBt 




W/R# 


W/Rt 




ADS# 


ADSt 




HLDA 






LOCK# 


LOCKt 


READY# 






KEN# 






NA# 






INT/CS8 






HOLD 








BE7#-BE0# 


BEt 




BREQ 





Within boundary scan mode the 860 microprocessor 
operates in one of two submodes: normal mode or 
shift mode, depending on the value of the SCAN 
input. A typical test sequence is . . . 
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1. Enter shift mode to assign values to the latches 
that correspond with the pins. 

2. Enter normal mode. In normal mode the 860 mi- 
croprocessor transfers the latched values to the 
output pins and latches the values that are being 
driven onto the input pins. 

3. Reenter shift mode to read the new values of the 
input pins. 

3.3.1 NORMAL MODE 

When SCAN is deasserted, the normal mode is se- 
lected. For each input pin (RESET, HOLD, 
INT/CS8, NA#, READY#, KEN#, SHI, BSCN, 
SCAN, CC1, and CCO), the corresponding latch is 
loaded with the value that is being driven onto the 
pin. 

The tristate output pins (A31-A3, BE7#-BE0#, 
W/R#, NENE#, ADS#, LOCK#, and PTB) are en- 
abled by the control latches ADDRt (for A31-A3), 
BEt, W/Rt, NENEt, ADSt, LOCKt, and PTBt. If a con- 
trol latch is set, the corresponding output latches 
drive their output pins; otherwise the pins are not 
driven. 

The I/O pins (D63-D0) are enabled by the control 
latch DATAt, which is similar to the other control 
latches. In addition, when DATAt Is not set, the data 
pins are treated as input pins and their values are 
latched. 



A tester causes entry into this mode for one of two 
purposes: 

1 . To assign values to output latches to be driven 
onto output pins upon subsequent entry Into nor- 
mal mode. 

2. To read the values of input pins previously latched 
in normal mode. 



4.0 BUS OPERATION 

A bus cycle begins when ADS# is activated and 
ends when READY# is sampled active. READY# is 
sampled one clock after assertion of ADS# and 
thereafter until it becomes active. New cycles can 
start as often as every other clock until three cycles 
are outstanding. A bus cycle Is considered outstand- 
ing as long as READY # has not been asserted to 
terminate that cycle. After READY # becomes ac- 
tive. It is not sampled again for the following (out- 
standing) cycle until the second clock after the one 
during which it became active. READY # is assumed 
to be inactive when it is not sampled. 

With regard to how a bus cycle is generated by the 
860 microprocessor, there are two types of cycles: 
pipelined and nonpipelined. Both types of cycles can 
be either read or write cycles. A pipelined cycle is 
one that starts while one or two other bus cycles are 
outstanding. A nonpipelined cycle is one that starts 
when no other bus cycles are outstanding. 



3.3.2 SHIFT MODE 

When SCAN is asserted, the shift mode is selected. 
In shift mode, the pins are organized into a boundary 
scan chain. The scan chain is configured as a shift 
register that is shifted on the rising edge of CLK. The 
SHI pin is connected to the input of one end of the 
boundary scan chain. The value of the most signifi- 
cant bit of the scan chain is output on the BREQ pin. 
To avoid glitches while the values are being shifted 
along the chain, all tristate outputs are disabled. The 
order of the pins within the chain is shown in Figure 
3.1. 



4.1 Pipelining 

A m-n read or write cycle is a cycle with a total cycle 
time of m clocks and a cycle-to-cycle time of n 
clocks (m > n). Total cycle time extends from the 
clock in which ADS# is activated to the clock in 
which READY # becomes active; whereas, cycle-to- 
cycle time extends from the time that READY # is 
sampled active for the previous cycle to the time 
that it is sampled active again for the current cycle. 
When m = n, a nonpipelined cycle is implied; m > n 
implies a pipelined cycle. 
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Figure 3.1. Order of Boundary Scan Chain 
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Pipelining may occur for the next bus cycle any time 
the current bus cycle requires more than two clock 
periods to finish (m > 2). The next cycle can be 
initiated when NA# is sampled active, even if the 
current cycle has not terminated. In this case, pipe- 
lining occurs. NA# is recognized only in the clock 
when ADS# has become inactive. 

To allow high transfer rates in large memory sys- 
tems, two-level pipelining is supported (i.e there may 
be up to three cycles in progress at one time). Pipe- 
lining enables a new word of data to be transferred 
every two clocks, even though the total cycle time 
may be up to six clocks. 



4.2 Bus State Machine 

The operation of the bus is described in terms of a 
bus state machine using a state transition diagram. 
Figure 4.1 illustrates the 860 microprocessor bus 
state machine. A bus cycle is composed of two or 
more states. Each bus state lasts for one CLK peri- 
od. 

The 860 microprocessor supports up to two levels of 
address pipelining. Once it has started the first bus 
cycle, it can generate up to two more cycles as long 
as READY # remains Inactive. To start a new bus 
cycle while other cycles are still outstanding, NA# 
must be active for at least one clock cycle starting 
with the clock after the previous ADS#. NA# is 
latched internally. 

States Tj and Tjk, for j = ! 1 ,2,3 S and k = j 1 ,2 ) , are 
used to describe the state of the 860 microproces- 
sor Bus State Machine. Index j indicates the number 
of outstanding bus cycles while index k distinguishes 
the intermediate states for the j-th outstanding cycle. 



Therefore there can be up to three outstanding cy- 
cles, and there are two possible intermediate states 
for each level of pipelining. Tji is the next state after 
Tj, as long as j cycles are outstanding. Tj2 is entered 
when NA# is active but the 860 microprocessor is 
not ready to start a new cycle. 

Five conditions have to be met to start a new cycle 
while one or more cycles are already pending: 

1. READY # inactive 

2. NA# having been active 

3. An internal request pending 

4. HOLD not active (or HOLD active, but not being 
serviced because LOCK# is active) 

5. Fewer than three cycles outstanding 

Upon hardware RESET, the bus control logic enters 
the idle state T| and awaits an internal request for a 
bus cycle. If a bus cycle is requested while there is 
no hold request from the system, a bus cycle begins, 
advancing to state T-| . On the next cycle, the state 
machine automatically advances to state T-|i. If 
READY # is active in state Tii, the bus control logic 
returns either to T|, if no new cycle is started, or to 
Ti, if a new cycle request is pending internally. In 
fact, if an internal bus request is pending each time 
READY # is active, the state machine continues to 
cycle between T-i i and Ti . 

However, if READY # is not active but the next ad- 
dress request is pending (as indicated by an active 
NA#), the state machine advances either to state 
T2 (if an internal bus request is pending, signifying 
that two bus cycles are now outstanding), or to state 
T-12 (if no bus internal request is pending, signifying 
NA# has been found active). Transitions from state 
T12 are similar to those from T-ji. 
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READY# DEASSERTED 



READY# DEASSERTED- 
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(NO REQUEST ■^ 
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HOLD DEASSERTED- 
NO REQUEST 




Assumed deasserted one clock after the ac- 
tive clock 

Not sampled during ADS# active clock 
Active in Ti , T2 and T3 
Active in Th 

Synchronized internally and masked by bus 
lock request 
Internal Bus Request Pending 



HOLD ASSERTED 
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Figure 4.1. Bus State Machine 



If two bus cycles are already outstanding (as indicat- 
ed by T2k for k = {1,2}) and NA# is latched active 
but READY # is not active, one more bus request 
causes entry into state T3. Transitions from this 
state are similar to those from T2. 

In general, if there is an internal bus request each 
time both READY # and NA# are active, the state 



machine continues to oscillate between Tji and T, 
forj = {2,3). 



When NA# is sampled active while there is a pend- 
ing bus request, ADS# is activated in the next clock 
period (provided no more than two cycles are al- 
ready outstanding). 
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Internal pending bus requests start new bus cycles 
only if no HOLD request has been recognized. Th is 
entered from the idle state T| only. HLDA is active In 
this state. There is a one clock delay to synchronize 
the HOLD input when the signal meets the respec- 
tive minimum setup and hold time requirements. The 
state machine uses the synchronized HOLD to move 
from state to state. 



4.3 Bus Cycles 

Figures 4.2 through 4.10 illustrate combinations of 
bus cycles. 



4.3.1 NONPIPELINED READ CYCLES 

A read cycle begins with the clock in which ADS# is 
asserted. The 860 microprocessor begins driving the 
address during this clock. It samples READY # for 
active state every clock after the first clock. A mini- 
mum of two clocks is required per cycle. Data is 
latched when READY # is found active when sam- 
pled at the end of a clock period. Figure 4.2 illus- 
trates nonpipelined read cycles with zero wait 
states. 

Normally, all 64 bits of the data bus are latched; 
however, in the case of noncacheable bus cycles, 
the byte enables BE7#-BE0# determine which 
bytes are used. In CSS mode, only the low-order 
eight bits are latched. 



CLK 



ADS# 




^ 



A31-A3, W/R#, 

BEn#, NENE#, 

LOCK#, PTB 



NA# 



READY# 



D63-D0 



CYCLE 1 

NON-PIPELINED 
READ 
(2-2) 



S2S 



mm 



mm 



>■ 



M 



•<:> 



CYCLE 2 

NON-PIPELINED 
READ 
(2-2) 



^ 



S2X 



mm 



MIA 



M 



•o 



CYCLE 3 

NON-PIPELINED 
READ 
(2-2) 



m 



S2X 



mim 



Milk 



M 



■c 



240296-13 



Figure 4.2. Fastest Read Cycles 
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Figure 4.3. Fastest Write Cycles 



4.3.2 NONPIPELINED WRITE CYCLES 

The ADS# and READY# activity for write cycles 
follows the same logic as that for read cycles, as 
Figure 4.3 illustrates for back-to-back, nonpipelined 
write cycles with zero wait-states. The byte enables 
BE7#-BE0# indicate which bytes on the data bus 
are valid. 

The fastest write cycle takes only two clocks to com- 
plete. However, when a read cycle immediately pre- 



cedes a write cycle, the write cycle must contain a 
wait state, as illustrated in Figure 4.4. Because the 
device being read might still be driving the data bus 
during the first clock of the write cycle, there is a 
potential for bus contention. To help avoid such con- 
tention, the 860 microprocessor does not drive the 
data bus until the second clock of the write cycle. 
The wait state is required to provide the additional 
time necessary to terminate the write cycle. In other 
read-write combinations, the 860 microprocessor 
does not require a wait state. 
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Figure 4.4. Fastest Read/Write Cycles 
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Figure 4.5. Pipelined Read Followed by Pipelined Write 
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Figure 4.6. Pipelined Write Followed by Pipelined Read 



4.3.3 PIPELINED READ AND WRITE CYCLES 

Figures 4.5 and 4.6 illustrate combinations of non- 
pipelined and pipelined read and write cycles. The 
following description applies to both diagrams. While 
Cycle 1 is still in progress, two new cycles are initiat- 
ed. By the time READY # first becomes active, the 
state machine has moved through states T-i, T^, 
T2. T21 , and T3. Cycles 3 and 4 show how activating 
READY # terminates an outstanding cycle (Cycle 3 
in this case), and yet activating NA# while there is 
an internal request pending adds a new outstanding 
cycle. 

In Figure 4.5, Cycle 3 is a write cycle following a read 
cycle; therefore, one wait state must be inserted. 
The 860 microprocessor does not drive the data bus 
until one clock after the read data is returned from 
the preceding read cycle. During Cycles 3 and 4, the 
state machine oscillates between states T3 and T31 



maintaining full bus capacity (two levels of pipelin- 
ing; three outstanding cycles). Cycles 2, 3, and 4 in 
Figure 4.6 are 5-2 cycles; i.e. each requires a total 
cycle time of five clocks while the throughput rate is 
one cycle every two clocks. 

Figure 4.7 illustrates in a more general manner how 
the NA# signal controls pipelining. Cycle 1 is a 2-2 
cycle, the fastest possible. The next cycle cannot be 
started any earlier; therefore, there is no need to 
activate NA# to start the next cycle early. Cycle 2, a 
3-3 read, is different. Cycle 3 can be started during 
the third state (a wait state) of Cycle 2, and NA# is 
asserted to accomplish this. 

NA# is not activated following the ADS# clock of 
Cycle 3, thereby allowing Cycle 3 to terminate be- 
fore the start of Cycle 4. As a result, Cycle 4 is a 
nonpipelined cycle. 
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Figure 4.7. Pipelining Driven by NA# 
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Figure 4.8. NA# Active with No Internal Bus Request 
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Figure 4.9. Locked Cycles 



When there is no internal bus request, activating 
NA# does not start a new cycle; the 860 microproc- 
essor, however, remembers that NA# has been ac- 
tivated. Figure 4.8 illustrates the situation where 
NA# is active but no internal bus request is pending. 
NA# is activated when two cycles are outstanding. 
Because there is no internal request pending until 
after one idle state, no new bus cycle is started dur- 
ing that period. 

4.3.4 LOCKED CYCLES 

The LOCK# signal is asserted when the current bus 
cycle is to be locked with the next bus cycle. Asser- 
tion of LOCK# may be initiated by a program's set- 
ting the BL bit of tne dirbase register (refer to sec- 
tion 2) or by the 860 microprocessor itself during 
page table updates. 

In Figure 4.9, the first read cycle is to be locked with 
the following write cycle. If there were idle states 
between the cycles, the LOCK# signal would re- 
main asserted. This is the case for a read/modify/ 
write operation. HOLD is not acknowledged until all 
locked cycles are finished. The second write cycle is 
hot locked because LOCK# is no longer asserted 
when the first write cycle starts. 



4.3.5 HOLD AND BREQ ARBITRATION CYCLES 

The HOLD, HLDA, and BREQ signals permit bus ar- 
bitration between the 860 microprocessor and an- 
other bus master. 

As Figure 4.10 illustrates, the Th (hold) state can be 
entered only from the idle state T|. When HOLD is 
asserted, the 860 microprocessor keeps control of 
the bus until all outstanding cycles, including locked 
cycles, are completed. 

If HOLD were asserted one clock earlier, the last 
860 microprocessor bus cycle before HLDA would 
not be started. LOCK# is assumed to be inactive in 
the last bus cycle. If LOCK# were active the 860 
microprocessor would not give up the bus. 

The outputs (except HLDA and BREQ) float when 
HLDA is asserted. HOLD is sampled at the end of 
the clock in which it is activated. Recommended set- 
up and hold times must be met to guarantee sam- 
pling one clock after external HOLD activation. 
When HOLD is sampled active, a one clock delay for 
internal synchronization follows. HLDA may be 
deasserted as early as the clock following deasser- 
tion of HOLD. 
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Figure 4.10. HOLD, HLDA, and BREQ 



If, during a HOLD cycle, an internal bus request is 
generated, BREQ is activated even though HLDA is 
asserted. It remains active at least until the clock 
after ADS# is activated for the requested cycle. 



4.4 Bus States During RESET 

Figure 4.11 shows how INT/CS8 is sampled during 
the clock period just before the falling edge of RE- 



SET. If INT/CS8 is sampled active, the 860 micro- 
processor enters CS8 mode. No inputs (except for 
HOLD, INT/CS8, and GC1 -CCO) are sampled dur- 
ing RESET. 

Note that, because HOLD is recognized even while 
RESET is active, the HLDA output signal may also 
become active during RESET. Refer to Figure "Out- 
put Pin Status during Reset" in section 3. 
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Figure 4.11. Reset Activities 
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5.0 MECHANICAL DATA 

Figures 5.1 and 5.2 show the locations of pins; Tables 5.1 and 5.2 help to locate pin Identifiers. 
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() 

A9 


() 

All 


() 

A14 


() 

A16 


() 

CLK 


() 

A22 


() 

A26 


() 

A30 


() 

CGI 


() 

062 


() 

060 


() 

Vss 


() 

Vcc 


3 


4 


() 

Vss 


() 

Vcc 


() 

A5 
























() 

063 


() 

059 


() 

Vss 


4 


5 


() 

Vcc 


() 

A4 


() 

A3 
























() 

061 


() 

058 


() 

056 


5 


6 


(} 

W/R# 


() 

NENE# 


() 

PTB 
























() 

057 


() 

054 


() 

052 


6 


7 


() 

ADS# 


() 

HLDA 


() 

BREQ 
























() 

055 


() 

053 


() 

050 


7 


8 


() 

LOCK# 


() 

KEN# 


() 

READY# 
























() 

051 


() 

049 


() 

048 


8 


9 


() 

INT/CS8 


() 

NA# 


() 

HOLD 
























() 

047 


() 

045 


() 

046 


9 


10 


() 

BE5# 


() 

BE7# 


() 

BE6# 
























() 

043 


() 

042 


() 

044 


10 


11 


() 

BE3# 


() 

BE2# 


() 

BE4# 
























() 

039 


() 

041 


() 

040 


11 


12 


() 

SHI 


() 

BE1# 


() 

BEO# 
























() 

037 


() 

036 


() 

038 


12 


13 


() 

RESET 


() 

SCAN 


() 

BSCN 
























() 

035 


() 

034 


() 

Vcc 


13 


14 


() 

Vss 


() 

DO 


() 

D1 
























() 

033 


() 

Vcc 


() 

Vss 


14 


15 


() 

Vcc 


() 

Vss 


() 

02 


() 

D3 


() 

D5 


() 

07 


() 

Oil 


() 

013 


() 

017 


() 

021 


() 

023 


() 

027 


() 

029 


() 

031 


() 

032 


() 

Vss 


() 

Vcc 


15 


16 


() 

Vss 


() 

Vcc 


() 

Vss 


() 

Vcc 


() 

04 


() 

09 


() 

08 


() 

015 


() 

014 


() 

019 


() 

022 


() 

025 


() 

028 


() 

030 


() 

Vss 


() 

Vcc 


() 

Vss 


16 


17 


() 

Vcc 


() 

Vss 


() 

Vcc 


() 

Vss 


() 

Vcc 


() 

06 


() 

010 


() 

012 


C) 

016 


() 

018 


() 

020 


() 

024 


() 

026 


() 

Vss 


() 

Vcc 


() 

Vss 


() 

Vcc 


17 


s 


R 


Q 


p 


N 


M 


L 


K 


J 


H 


G 


F 


E 





c 


B 


A 
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Figure 5.1. Pin Configuration-^ View from Top Side 
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1 


A 


B 


c 


D 


E 


F 


G 


H J K 


L 


M 


N 


P 


Q 


R 


s 


1 


o 

Vcc 


o 

Vss 


o 

Vcc 


o 

Vss 


o 

Vcc 


o 

A31 


o 

A29 


o o o 

A25 A23 A21 


o 

A19 


o 

A17 


o 

A12 


o 

Vss 


o 

Vcc 


o 

Vss 


o 

Vcc 


2 


o 

Vss 


o 

Vcc 


o 

Vss 


o 

Vcc 


o 

ceo 


o 

A28 


o 

A27 


o o o 

A24 A20 A18 


o 

A15 


o 

A13 


o 

A10 


o 

AS 


o 

Vss 


o 

Vcc 


o 

Vss 


2 


3 
4 
5 


o 

Vcc 

o 

Vss 

o 

D56 


o 

Vss 

o 

D59 

o 

D58 


o 

D60 

o 

D63 

o 

D61 


o 

D62 


o 

CC1 


o 

A30 


o 

A26 


o o o 

A22 CLK A1 6 


o 

A14 


o 

All 


o 

A9 


o 

A7 


o 

A6 

o 

A5 

o 

A3 


o 

Vss 

o 

Vcc 

o 

A4 


o 

Vcc 

o 

Vss 

o 

Vcc 


3 

4 
5 










METAL LID 










6 


o 

D52 


o 

D54 


o 

D57 




















o 

PTB 


o 

NENE# 


o 

W/R# 


6 


7 


o 

D50 


o 

D53 


o 

D55 




















o 

BREQ 


o 

HLOA 


o 

ADS# 


7 


8 


o 

D48 


o 

D49 


o 

D51 




















o 

REAOY# 


o 

KEN# 


o 

LOCK# 


8 


9 


o 

D46 


o 

D45 


o 

D47 




















o 

HOLO 


o 

NA# 


o 

INT/CS8 


9 


10 


o 

D44 


o 

D42 


o 

D43 




















o 

BE6# 


o 

BE7# 


o 

BE5# 


10 


11 


o 

D40 


o 

D41 


o 

D39 




















o 

BE4# 


o 

BE2# 


o 

BE3# 


11 


12 


o 

D38 


o 

D36 


o 

D37 




















o 

BEO# 


o 

BE1# 


o 

SHI 


12 


13 


o 

Vcc 


o 

D34 


o 

D35 




















o 

BSCN 


o 

SCAN 


o 

RESET 


13 


U 


o 

Vss 


o 

Vcc 


o 

D33 




















o 

01 


o 

00 


o 

Vss 


14 


15 


o 

Vcc 


o 

Vss 


o 

D32 


o 

D31 


o 

D29 


o 

027 


o 

023 


O O O 

021 017 013 


o 

oil 


o 

07 


o 

05 


o 

03 


o 

02 


o 

Vss 


o 

Vcc 


15 


16 


o 

Vss 


o 

Vcc 


o 

Vss 


o 

D30 


o 

D28 


o 

025 


o 

022 


o o o 

019 014 015 


o 

08 


o 

09 


o 

04 


o 

Vcc 


o 

Vss 


o 

Vcc 


o 

Vss 


16 


17 


o 

Vcc 


o 

Vss 




Vcc 


o 

Vss 


o 

026 


o 

024 


o 

020 


o o o 

018 016 012 


o 

010 


o 

06 


o 

Vcc 


o 

Vss 


o 

Vcc 


o 

Vss 


o 

Vcc 


17 


A 


B 


c 


D 


E 


F 


G 


H J K 


L 


M 


N 


p 


Q 


R 


s 
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Figure 5.2. Pin Configuration— View from Pin Side 
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Table 5.1. Pin Cross Reference by Location 



Location Signal 

A1 Vcc 

A2 Vss 

A3 Vcc 

A4 Vss 

A5 D56 

A6 D52 

A7 D50 

A8 D48 

A9 D46 

A10 D44 

A11 D40 

A12 D38 

A13 Vcc 

A14 Vss 

A15 Vcc 

A16 Vss 

A17 Vcc 

B1 Vss 

B2 Vcc 

B3 ...Vss 

84 D59 

B5 D58 

B6 D54 

87 D53 

88 D49 

89 D45 

810 D42 

811 D41 

812 D36 

813 D34 

814 Vcc 

815 Vss 

816 Vcc 

817 Vss 

01 Vcc 

02 Vss 

03 D60 

04 D63 

05 D61 

06 D57 

07 D55 

08 D51 



Location 



Signal 



Location 



Signal 



Location Signal 

Q10 8E6# 

Q11 8E4# 

Q12 8E0# 

Q13 8S0N 

Q14 D1 

Q15 D2 

Q16 Vss 

Q17 Vcc 

R1 Vss 

R2 Vcc 

R3 Vss 

R4 Vcc 

R5 A4 

R6 NENE# 

R7 HLDA 

R8 KEN# 

R9 NA# 

RIO BE7# 

R11 8E2# 

R12 8E1# 

R13 SOAN 

R14 DO 

R15..... Vss 

R16 Vcc 

R17 Vss 

31 Vcc 

32 Vss 

33 Vcc 

34 Vss 

35 Vcc 

36 W/R# 

37 AD3# 

38 LOOK# 

39 INT/C38 

310 8E5# 

311 8E3# 

312 SHI 

313 RESET 

314 Vss 

315 Vcc 

316 Vss 

317 Vcc 



09.. 

010. 

Oil. 

012. 

013. 

014. 

015. 

016. 

017. 

D1 .. 

D2.. 

D3.. 

D15. 

D16. 

D17. 

El .. 

E2.. 

E3.. 

E15. 

E16. 

E17. 

F1 .. 

F2.. 

F3.. 

F15. 

F16. 

F17. 

G1.. 

G2.. 

G3.. 

G15 

G16 

G17 

HI .. 

H2.. 

H3.. 

H15. 

H16. 

H17. 

J1 .. 

J2 .. 

J3.. 



.D47 
.D43 
.D39 
.D37 
.D35 
.D33 
.D32 
.Vss 
.Vcc 
.Vss 

.Vcc 

.D62 
.D31 
.D30 

.Vss 

.Vcc 
.000 
.001 
.D29 
.D28 
.D26 
.A31 
.A28 
.A30 
.D27 
.D25 
.D24 
.A29 
.A27 
.A26 
.D23 
.D22 
.D20 
.A25 
.A24 
.A22 
.D21 
.D19 
.D18 
.A23 
.A20 
.OLK 



D17 

D14 

D16 

A21 

A18 

A16 

D13 

D15 

D12 

A19 

A15 

A14 

D11 

D8 

D10 

A17 

A13 

......All 

D7 

D9 

D6 

A12 

A10 

A9 

D5 

D4 

Vcc 

Vss 

A8 

A7 

D3 

Vcc 

Vss 

Vcc 

Vss 

A6 

A5 

A3 

PT8 

BREQ 

Q8 READY# 

Q9 HOLD 



J15 . 

J16. 

J17. 

K1 .. 

K2.. 

K3.. 

K15. 

K16. 

K17. 

LI .. 

L2 .. 

L3 .. 

L15. 

L16. 

L17. 

Ml.. 

M2.. 

M3.. 

M15 

M16 

M17 

N1 .. 

N2.. 

N3.. 

N15. 

N16. 

N17. 

PI .. 

P2 .. 

P3 .. 

P15. 

P16. 

P17. 

Q1.. 

Q2.. 

Q3.. 

Q4.. 

Q5.. 

Q6 . 

07., 
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Table 5.2. Pin Cross Reference by Pin Name 






Signal 


Location 


Signal Location 


Signal Location 


Signal 


Location 


A3 


Q5 


CLK J3 


D41 ....B11 


Vcc 


B16 


A4 


R5 


DO R14 


D42 B10 


Vcc.... 


01 


A5 


Q4 


D1 Q14 


D43 C10 


Vcc 


017 


A6 


Q3 


D2 Q15 


D44 A10 


Vcc 


D2 


A7 


P3 


D3 P15 


D45 B9 


Vcc.... 


El 


A8 


P2 


D4 N16 


D46 A9 


Vcc 


N17 


A9 


N3 


D5 N15 


D47... C9 


Vcc 


P16 


A10 


N2 


D6 M17 


D48 A8 


Vcc 


01 


A11 


M3 


D7 M15 


D49 B8 


Vcc .... 


017 


A12 


N1 


DB L16 


D50 A7 


Vcc 


R2 


A13 


M2 


D9 M16 


D51 C8 


Vcc 


R4 


A14 .... 


L3 


D10 L17 


D52 A6 


Vcc 


R16 


A15 .... 


L2 


D11 L15 


D53 B7 


Vcc.... 


SI 


A16.... 


K3 


D12 K17 


D54 B6 


Vcc.... 


S3 


A17 


M1 


D13 K15 


D55 C7 


Vcc ... . 


S5 


A18.... 


K2 


D14 J16 


D56 A5 


Vcc 


.......S15 


A19 .... 


L1 


D15 K16 


D57 .C6 


Vcc 


S17 


A20 . . . . 


J2 


D16 J17 


D58 B5 


Vss . . . . 


A2 


A21 ..... 
A22 


K1 

H3 


D17 J15 


D59 B4 

D60 03 


Vss 

Vss 


A4 

A14 


D18 H17 


A23 . . . . 


J1 


D19 H16 


D61 C5 


Vss 


A16 


A24 


H2 


D20 G17 


D62 D3 


Vss ... . 


B1 


A25 


H1 


D21 H15 


D63 C4 


Vss 


B3 


A26 


G3 


D22 G16 


HLDA R7 


Vss 


B15 


A27 


G2 


D23 G15 


HOLD 09 


Vss 


B17 


A28 . . . . 


F2 


D24 F17 


INT/CS8 S9 


Vss 


02 


A29 


G1 


D25 F16 


KEN# R8 


Vss 


016 


A30 . . . . 


F3 


D26 E17 


LOCK# S8 


Vss 


D1 


A31 


F1 


D27 F15 


NA# R9 


Vss 


D17 


ADS#... 


S7 


D28.. E16 


NENE# R6 


Vss.... 


PI 


BEO#... 


Q12 


D29 E15 


PTB 06 


Vss 


P17 


BE1 # . . . 


R12 


D30 D16 


READY# 08 


Vss 


02 


BE2#... 


R11 


D31 D15 


RESET 813 


Vss 


016 


BE3# ... 


S11 


D32 C15 


SCAN R13 


Vss 


R1 


BE4#... 


Q11 


D33 C14 


SHI S12 


Vss 


R3 


BE5# ... 


S10 


D34 B13 


VCC A1 


Vss 


R15 


BE6#... 


Q10 


D35 C13 


VCC A3 


Vss 


R17 


BE7#... 


R10 


D36 B12 


Vcc A13 


Vss ... . 


S2 


BREQ... 


Q7 


D37 C12 


Vcc A15 


Vss.... 


S4 


BSCN... 


Q13 


D38 A12 


Vcc A17 


Vss..... 


S14 


ceo 


E2 


D39 C11 


Vcc B2 


Vss 


S16 


CC1 


E3 


D40 ....A11 


Vcc B14 


W/R# . 


S6 
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Table 5.3. Ceramic PGA Package Dimension Symbols 



Letter or 
Symbol 


Description of Dimensions 


A 


Distance from seating plane to highest point of body 


Ai 


Distance between seating plane and base plane (lid) 


A2 


Distance from base plane to highest point of body 


A3 


Distance from seating plane to bottom of body 


B 


Diameter of terminal lead pin 


D 


Largest overall package dimension of length 


Di 


A body length dimension, outer lead center to outer lead center 


ei 


Linear spacing between true lead position centerlines 


L 


Distance from seating plane to end of lead 


Si 


Other body dimension, outer lead center to edge of body 



NOTES: 

1 . Controlling dimension: millimeter. 

2. Dimension "ei" ("e") is non-cumulative. 

3. Seating plane (standoff) is defined by P.C. board hole size: 0.0415-0.0430 inch. 

4. Dimensions "B", "Bi" and "C" are nominal. 

5. Details of Pin 1 identifier are optional. 
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D 




"*1 






r — ^ Di — H 




— 


01.65 


®®®®®®®®®®®®®®®®® 




L_ 


®@®®®®@®®®®®®®®®® 




t®®©®®®®®®®®®®®©®® 




r~ 








® ® ® 




® ® ® 




® ® ® 




® ® ® 






® ® ® 


^ ^ 


® ® ® 






® ® ® 




® ® ® 






® ® ® 


^ \ 


® ® ® 






® ® @ 


® ® ® 


D 




® ® ® 




® ® ® 




PIN C3^ 


® ® ® 


V y 


® ® ® 






^ ® ® 


® ® ® 






V® ® 




® ® ® 






® @r® 




® ® ® 






@ @B 


®®®®®®®®®@® 


©® ® 




®®®®®®®@®®®@®® ©"No) ® 




\ 


®®®®®®®®®®®®®®® (i\® 





T 

2.29 
1.52 
450 CHAMFER 
(INDEX CORNER) 



SEATING_ 
PLANE 






rREF. 



SWAGGED- 

PIN 

(4 PL) 



BASE_ 
PLANE 



SEATING 
PLANE"* 
0B (ALL PINS) 



«] 



SWAGGED 

PIN 

DETAIL 



240296-30 



Family: Ceramic Pin Grid Array Pacl(age 


Symbol 


Millimeters 


Inches 


Min 


Max 


Notes 


Min 


Max 


Notes 


A 


3.56 


4.57 




0.140 


0.180 




Ai 


0.64 


1.14 


SOLID LID 


0.025 


0.045 


SOLID LID 


A2 


23 


0.30 


SOLID LID 


0.110 


0.140 


SOLID LID 


A3 


1.14 


1.40 




0.045 


0.055 




B 


0.43 


0.51 




0.017 


0.020 




D 


44.07 


44.83 




1.735 


1.765 




Di 


40.51 


40.77 




1.595 


1.605 




©1 


2.29 


2.79 




0.090 


0.110 




L 


2.54 


3.30 




0.100 


0.130 




N 


168 




168 




Si 


1.52 


2.54 




0.060 


0.100 




ISSUE 


IWS REVX 7/15/88 





Figure 5.3. 168 Lead Ceramic PGA Package Dimensions 
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6.0 PACKAGE THERMAL 
SPECIFICATIONS 

The 860 microprocessor is specified for operation 
when Tc (the case temperature) Is within the range 
of 0°C~85°C. Tc may be measured in any environ- 
ment to determine whether the 860 microprocessor 
is within specified operating range. The case tem- 
perature should be measured at the center of the 
top surface opposite the pins. 

Ta (the ambient temperature) can be calculated 
from ^CA (thermal resistance from case to ambient) 
with the following equation: 

Ta = Tc - P*0CA 

Typical values for ^CA at various airflows are given 
in Table 6.1 for the 1.75 sq. In., 168 pin, ceramic 
PGA. 

Table 6.2 shows the maximum Ta allowable (without 
exceeding Tc) at various airflows and operating fre- 
quencies (fcLK)- 



Note that Ta is greatly improved by attaching "fins" 
or a "heat sink" to the package. P (the maximum 
power consumption) is calculated by using the maxi- 
mum Ice at 5V as tabulated in the DC Characteris- 
tics of section 7. 

Table 6.1. Thermal Resistance (Oqa) at Various 
Airflows 







ln° 


C/Watt 








Airf low-ft/min (m/sec) 





(0) 


200 
(1.01) 


400 
(2.03) 


600 
(3.04) 


800 
(4.06) 


1000 
(5.07) 


(9cAwith 
Heat Sink* 


13 


9 


5.5 


5.0 


3.9 


3.4 


^CA without 
Heat Sink 


17 


14 


11 


9 


7.1 


6.6 



*0.285" high unidirectional heat sink (Al alloy 6061, 50 mil 
fin width, 150 mil center-to-center fin spacing). 



Table 6.2. Maximum Ta at Various Airflows 
ln°C 







Airflow-ft/min (m/sec) 




fCLK 
(l\/IHz) 



(0) 


200 
(1.01) 


400 
(2.03) 


600 
(3.04) 


800 
(4.06) 


1000 
(5.07) 


Ta with 
Heat Sink* 


33.3 


46 


58 


69 


70 


73 


75 


40.0 


43 


56 


67 


69 


72 


74 


Ta without 
Heat Sink 


33.3 


34 


43 


52 


58 


64 


65 


40.0 


30 


40 


49 


56 


62 


64 



*0.285" high unidirectional heat sink (Al alloy 6061, 50 mil fin width, 150 mil 
center-to-center fin spacing). 
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7.0 ELECTRICAL DATA 

Inputs and outputs are TTL compatible. All input and 
output timings are specified relative to the 1.5 volt 
level of the rising edge of CLK and refer to the point 
that the signal reaches 1 .5V. 



7.1 Absolute Maximum Ratings 

Case Temperature Tc under Bias 0°C to 85°C 

Storage Temperature -65°C to + 1 50°C 

Voltage on Any Pin 

with Respect to Ground - 0.5 to Vcc + 0-5V 



* Notice: Stresses above those listed under "Abso- 
lute Maximum Ratings" may cause permanent dam- 
age to the device. This is a stress rating only and 
functional operation of the device at these or any 
other conditions above those indicated in the opera- 
tional sections of this specification is not implied. Ex- 
posure to absolute maximum rating conditions for 
extended periods may affect device reliability. 

NOTICE: Specifications contained within the 
following tables are subject to change. 



7.2 D.C. Characteristics 



Table 7.1. DC Characteristics 



Symbol 


Parameter 


Min 


Max 


Units 


Notes 


V|L 


Input LOW Voltage 


-0.3 


+ 0.8 


V 




V|H 


Input HIGH Voltage 


2.0 


Vcc + 0.3 


V 




Vol 


Output LOW Voltage 




0.45 


V 


(Notel) 


VOH 


Output HIGH Voltage 


2.4 




V 


(Note 2) 


Ice 


Power Supply Current 












CLK = 33.3 MHz 




600 


mA 


Vcc @5V 




CLK = 40.0 MHz 




650 


mA 


Vcc @5V 


III 


Input Leakage Current 




±15 


^lA 


No pullup 
or pulldown 


Ilo 


Output Leakage Current 




±15 


l,k 




C|N 


Input Capacitance 




15 


PF 




Co 


I/O or Output Capacitance 




15 


PF 




CcLK 


Clock Capacitance 




20 


pF 





NOTES: 

1 . This parameter is measured at 4.0 mA for address, data, and byte enables; at 5.0 mA for definition and control. 

2. This parameter is measured at 1 .0 mA for address, data, and byte enables; at 0.9 mA for definition and control. 
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7.3 A.C. Characteristics 





Table 7.2. A.C. Characteristics Tq = 


= to 85°C, Vcc 


= 5V± 


5% 






All timings measured at 1 .5V unless othen/vise specified. 






Symbol 


Parameter 


33.3 MHz 


40.0 MHz 


Test 
Conditions 


Min 


Max 


Min 


Max 






(ns) 


(ns) 


(ns) 


(ns) 




t1 


CLK period 


30 


125 


25 


125 




t2 


CLK high time 


7 




5 




at2V 


t3 


CLK low time 


7 




5 




at.SV 


t4 


CLK fall time 




4 




4 




t5 


CLK rise time 




4 




4 




tea 


A31-A3, PTB, W/R#, NENE# valid 


3.5 


38 


3.5 


29 


50 pF load 


t6b 


BEn# valid 


3.5 


41 


3.5 


32 


50 pF load 


t7 


Float time, all outputs 


3.5 


30 


3.5 


25 


(Notel) 


t8 


ADS#. BREQ, LOCK#, HLDA valid delay 


3.5 


26 


3.5 


21 


50 pF load 


t9 


D63-D0 valid delay 


3.5 


47 


3.5 


43 


50 pF load 


t10 


Setup time, all inputs except INT, HOLD 


13 




10 






t11 


Hold time, all inputs except INT, HOLD 


4 




4 






t12 


INT, HOLD setup time 


22 




15 




(Note 2) 


t13 


INT, HOLD hold time 


5 




5 




(Note 2) 



NOTES: 

1. Float condition occurs when maximum output current becomes less than Ilo in magnitude. Float delay is not tested. 

2. INT and HOLD are asynchronous inputs. The setup and hold specifications are given for test purposes or to assure 
recognition on a specific rising edge of CLK. 
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CLK 




INPUT 
SETUP 

t12n,in 

^ >■ 



INPUTS 



-( 



t10„ 



OUTPUTS 



OUTPUTS 



t2 




INPUT 
HOLD 



t13„ 



t11. 



)- 



^^moxA^maxA^ma: 



k- ^6^InA8minA9, 



:ssssi 



VALID 



t7„ 



))))))))r " 



t3 




Figure 7.1. CLK, Input, and Output Timings 
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nom +15 



nom +10 



TYPICAL* OUTPUT 

DELAY (ns) "O"^ +5 
@ 1.5V 



nom -5 




D53-DO 



V/R#. 



NENE# 
LqCK#, HLDA 



25 50 75 100 125 150 

LOAD CAPACITANCE, Cl (pf) 



240296-26 



NOTES: 

Graphs are not linear outside the Cl range shown, 
nom = nominal value given in the AC timing table. 
*Typical part under worst-case conditions. 



Figure 7.2. Typical Output Delay vs Load Capacitance under Worst-Case Conditions 



18 
15 

12 

TYPICAL* OUTPUT 
SLEW TIME (ns) 9 
(0.8 -2.0V) 

6 




1-A3, 



BE7#-BE3# 



A )S#, BREQ, LOCK#, HLDA 



PTB, V /R#, NENE# 



25 50 75 100 125 150 
LOAD CAPACITANCE, Cl (pf) 



NOTES: 

Graphs are not linear outside the Cl range shown. 
*Typical part under worst-case conditions. 



Figure 7.3. Typical Slew Time vs Load Capacitance under Worst-Case Conditions 
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NOTES: 

Graphs are nc 
shown. 
*Worst-case su 
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FREQUENCY (MHz) 

240296-28 

t linear outside the frequency range 
pply current at 5V. 



Figure 7.4. Typical Ice vs Frequency 

8.0 INSTRUCTION SET 

Key to abbreviations: 

srd A register (integer or floating-point 

depending on class of instruction) 
or a 16-bit Immediate value. The 
immediate value is sign-extended 
for add and subtract operations 
and zero-extended for logical oper- 
ations. 

srdni Same as srd except that no im- 

mediate value is permitted. 

src2 A register (integer or floating-point 

depending on class of instruction). 

rdest A register (integer or floating-point 

depending on class of instruction). 

freg A floating-point register. 

ireg An integer register. 

ctrlreg One of the control registers fir, 

psr, dirbase, db, or fsr. 

# const A 16-bit immediate address offset 

that the 860 microprocessor sign- 
extends to 32 bits when computing 
the effective address. 



mem.x(address) The contents of the memory loca- 
tion indicated by address with a 
size of X. 

.p Precision specification. Unless oth- 

erwise specified, floating-point op- 
erations accept single- or double- 
precision source operands and 
produce a result of equal or great- 
er precision. Both input operands 
must have the same precision. The 
source and result precision are 
specified by a two-letter suffix to 
the mnemonic of the operation, as 
shown in the table below. 



.w 

.X 



.z 
Ibroff 

sbroff 

brx 



srds 



PM 



Suffix 


Source 
Precision 


Result 
Precision 


.ss 
.sd 
.dd 


single 
single 
double 


single 

double 

double 



.ss (32 bits), or .dd (64 bits) 

.b (8 bits), .s (16 bits), or .11 (32 
bits) 

.1 (32 bits), .d (64 bits), or .q (128 
bits) 

.1 (32 bits, or .d (64 bits) 

A signed, 26-bit, immediate, rela- 
tive branch offset 

A signed, 16-bit, immediate, relat- 
vie branch offset 

A function that computes the tar- 
get address of a branch by shifting 
the offset (either Ibroff or sbroff) 
left by two bits, sign-extending it to 
32 bits, and adding the result to 
the address of the current control- 
transfer instruction plus four. 

An integer register or a 5-bit imme- 
diate that is zero-extended to 32 
bits. 

The pixel mask, which is consid- 
ered as an array of eight bits 
PM[0]..PM[7], where PM[0] is the 
least significant bit. 
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8.1 Instruction Definitions in Alpliabetical Order 

adds srcl, src2, rdest Add Signed 

rdest <— srcl -\- src2 
OF <^ (bit 31 carry ^ bit 30 carry) 
CC set if src2 < comp2(src1) (signed) 
CC clear if src2 ^ comp2(src1) (signed) 

addu src1, src2, rdest Add Unsigned 

rdest <— srd + src2 
OF '^ bit 31 carry 
CC "^ bit 31 carry 

and src1, src2, rdest Logical AND 

rdest <r— src1 and src2 

> CC set if result is zero, cleared othen/vlse 

andii * const, src2, rdest Logical AND High 

rdest <r-' (#c70/7sf shifted left 16 bits) and src2 

> CC set if result is zero, cleared otherwise 

andnot srcl, src2, rdest Logical AND NOT 

rdest <r- not srcl and src2 

> CC set if result is zero, cleared otherwise 

andnoth * const, src2, rdest Logical AND NOT High 

rdest <r- not (#co/7s^ shifted left 16 bits) and src2 

> CC set if result is zero, cleared othen/vise 

be Ibroff Branch on CC 

IF CC = 1 

THEN continue execution at brx(lbroff) 

Fl 

bet Ibroff Branch on CC, Taken 

IF CC = 1 

THEN execut one more sequential instruction 

continue execution at brx(lbroff) 
ELSE skip next sequential instruction 
Fl 

bla srdni, src2, sbroff Branch on LCC and Add 

LCC-temp clear if src2 < comp2(src1ni) (signed) 

LCC-temp set if src2 ^ comp2(src1ni) (signed) 
src2 <— srdni + src2 
Execute one more sequential instruction 
IF LCC 

THEN LCC <r- LCC-temp 

continue execution at brx(sbroff) 
ELSE LCC ^ LCC-temp 
Fl 

bnc Ibroff Branch on Not CC 

IF CC = 

THEN continue execution at brx(lbroff) 

Fl 

bnc.t Ibroff Branch on Not CC, Taken 

IF CC = 

THEN execute one more sequential instruction 

continue execution at brx(lbroff) 
ELSE skip next sequential instruction 
Fl 

br Ibroff Branch Direct Unconditionally 

Execute one more sequential instruction. 
Continue execution at brx(lbroff). 
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bri Isrclni] Branch indirect unconditionally 

Execute one more sequential instruction 
IF any trap bit in psr is set 

THEN copy PU to U. PIM to IM in psr 
clear trap bits 

IF DS is set and DIM is reset 

THEN enter dual-instruction mode after executing one 

instruction in single-instruction mode 
ELSE IF DS is set and DIM is set 

THEN enter single-instruction mode after executing one 

instruction in dual-instruction mode 
ELSE IF DIM is set 

THEN enter dual-instruction mode 
for next two instructions 
ELSE enter single-instruction mode 

for next two instructions 
Fl 
Fl 
Fl 
Fl 

Continue execution at address in s/r/A7/ 
(The original contents of srdni is used even if the next instruction 
modifies src/;?/. Does not trap if src/A7/ is misaligned.) 

bte srds, src2, sbroff Branch If Equal 

IF srds = src2 

THEN continue execution at brx(sbroff) 

Fl 

btne srds, src2, sbroff Branch If Not Equal 

IF srds ^ src2 

THEN continue execution at brx(sbroff) 

Fl 

call Ibroff Subroutine Call 

r1 <r- address of next sequential instruction + 4 
Execute one more sequential instruction 
Continue execution at brx(lbroff) 

calli [srdni] Indirect Subroutine Call 

r1 <r- address of next sequential instruction + 4 
Execute one more sequential instruction 
Continue execution at address in srdni 

(The original contents of srdni \s used even if the next instruction 

modifies srdni. Does not trap if srdni is misaligned.) 

fadd.p srd, src2, rdest Floating-Point Add 

rdest <— srd -\- src2 

faddp srd, src2, rdest Add with Pixel Merge 

rdest <— srd + src2 

Shift and load MERGE register as defined in Table 8.1 

faddz srd, src2, rdest .Add with Z Merge 

rdest <— srd + src2 

Shift MERGE right 16 and load fields 31..16 and 63..48 

fladd.w srd, src2, rdest Long-Integer Add 

rdest <^ srd + src2 

fisub.w srd, src2, rdest Long-Integer Subtract 

rdest -^r— srd - src2 

f Ix.p srd, rdest Floating-Point to Integer Conversion 

rdest <— 64- bit value with low-order 32 bits equal to integer part of srd rounded 

Floating-Point Load 
fld.y srd(src2), freg (Normal) 
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fid.y src1(src2)+ + , freg (Autoincrement) 

freg <r- mem.y (src1 + src2) 

IF autoincrement 

THEN src2 <— src1 + src2 

Fl 

Cache Flush 

flush if= const(src2) (Normal) 

flush #const{src2) + + (Autoincrement) 

Replace block in data cache with address (# const + src2). 

Contents of block undefined. 

IF autoincrement 

THEN src2 <r- #const + src2 

Fl 

fmlow.p srcl, src2, rdest Floating-Point Multiply Low 

rdest <^ low-order 53 bits of src1 mantissa X src2 mantissa 
rdest bit 53 <— most significant bit of mantissa 

fmov.r src1, rdest Floating-Point Reg-Reg Move 

Assembler pseudo-operation 

fmov.ss src1, rdest = fiadd.ss srcl, fO, rdest 
fmov.dd srcl, rdest = fiadd.dd srcl, fO, rdest 
fmov.sd srcl, rdest = fadd. sd src1, fO, rdest 
fmov.ds src1, rdest = fadd.ds srcl, fO, rdest 

fmul.p srcl, src2, rdest Floating-Point Multiply 

rdest <— srcl x src2 

f nop Floating-Point No Operation 

Assembler pseudo-operation 
fnop = shrd rO, rO, rO 

form srcl, rdest OR with MERGE Register 

rdest <r-- srcl OR MERGE 
MERGE <r- 

f rcp.p src2, rdest Floating-Point Reciprocal 

rdest -<— 1 /src2 with maximum mantissa error < 2-7 
f rsqr.p src2, rdest Floating-Point Reciprocal Square Root 

rdest <— 1 /SORT (src2) with maximum mantissa error < 2-7 

Floating-Point Store 

fst.y freg, src1(src2) (Normal) 

fst.y freg, src1(src2) + + (Autoincrement) 

mem.y (src2 + src1) <r- freg 

IF autoincrement 

THEN src2 <^ src1 + src2 

Fl 
fsub.p src1, src2, rdest Floating-Point Subtract 

rdest <— srcl - src2 
ftrunc.p srcl, rdest Floating-Point to Integer Conversion 

rdest <— 64-bit value with low-order 32 bits equal to integer part of srcl 
fxfr srcl, freg Transfer F-P to Integer Register 

freg <— srcl 
fzchkl srcl, src2, rdest 32-Bit Z-Buffer Check 

Consider srcl, src2, and rdest as arrays of two 32-bit 

fields src1(0)..src1(1), src2(0)..src2(1), and rdest(0)..rdest(A) 
where zero denotes the least-significant field. 

PM <^ PM shifted right by 2 bits 

FOR i = to 1 

DO 

PM [i + 6] <— src2{\) <. srclij) (unsigned) 
rdest(\) '<— smaller of src2(\) and src1(\) 

OD 

MERGE <- 

60 



IrrteT 



i860TM MICROPROCESSOR 



MMM\©E DKIF©[^IM]MD@M 



fzchks srcl, src2, rdest 16-Bit Z-Buffer Check 

Consider src1, src2, and rdest as arrays of four 16-bit 

fields src1(0)..src1{3), src2(0)..src2(3), and rdest(0)..rdest(3) 

wiiere zero denotes the least-significant field. 
PM <— PM shifted right by 4 bits 
FOR i = to 3 
DO 

PM [i + 4] ^^— src2{\) ^ srcHj) (unsigned) 

rdesti}) '^— smaller of src2(\) and src1(\) 
OD 
MERGE <- 

intovr Software Trap on integer Overt iow 

If OF in epsr = 1 , generate trap with IT set in psr. 

ixfr srclnl, freg — Transfer integer to F-P Register 

freg <— srdni 

id.c ctrlreg, rdest Load from Controi Register 

rdest <^ ctrlreg 

id.x src1(src2), rdest Load integer 

rdest ^r— mem.x(src1 + src2) 

iock Begin interiocked Sequence 

Set BL in dirbase. The next load or store locks the bus. 
Disable interrupts until the bus is unlocked. 

mov src2, rdest Register-Register i\/iove 

Assembler pseudo-operation 

mov src2, rdest = shi rO, src2, rdest 

nop Core-Unit No Operation 

Assembler pseudo-operation 
nop = shi rO, rO, rO 

or srcl, src2, rdest Logical OR 

rdest "^— srcl OR src2 

> CC set if result is zero, cleared otherwise 

orh * const, src2, rdest Logicai OR High 

rdest <— (#coA7Sf shifted left 16 bits) OR src2 

> CC set if result is zero, cleared otherwise 

pfadd.p srcl, src2, rdest Pipelined Fioating-Point Add 

rdest <r- last A-stage result 
Advance a pipeline one stage 
A pipeline first stage <— srcl + src2 

pfaddp srcl, src2, rdest Pipelined Add with Pixel Merge 

rdest <— last-stage l-result 

last stage l-result <— srcl -\- src2 

Shift and load MERGE register from last-stage l-result as defined in Table 8.1 

pfaddz srcl, src2, rdest Pipelined Add with Z l\/lerge 

rdest <— last-stage l-result 

last stage l-result -f— srcl + src2 

Shift MERGE right 16 and load fields 31. .16 and 63..48 from last-stage l-result 

pfam.p srcl, src2, rdest Pipelined Floating-Point Add and Multiply 

rdest <r- last A-stage result 

Advance A and M pipeline one stage (operands accessed before advancing pipeline) 

A pipeline first stage '^— A-opI + A-op2 

M pipeline first stage <— M-opI x M-op2 
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pfeq.p src1, src2, rdest Pipelined Floating-Point Equal Compare 

rdest <— last A-stage result 

CO set if src1 = src2, else cleared 

Advance A pipeline one stage 

A pipeline first stage is undefined, but no result exception occurs 

pfgtp srcl, src2, rdest Pipelined Floating-Point Greather-Than Compare 

(Assembler clears R-bit of instruction) 

rdest <— last A-stage result 

CO set if src1 > src2, else cleared 

Advance A pipeline one stage 

A pipeline first stage is undefined, but no result exception occurs 

pfiadd.w src1, src2, rdest Pipelined Long-Integer Add 

rdest <— last-stage l-result 
last-stage l-result <— srd -\- src2 

pfisub.w srd, src2, rdest Pipelined Long-Integer Subtract 

rdest <— last-stage l-result 
last-stage l-result <^ srcl - src2 

pfix.p srd, rdest Pipelined Floating-Point to Integer Conversion 

rdest <— last A-stage result 
Advance A pipeline one stage 

A pipeline first stage <— 64-bit value with low-order 32 bits 
equal to integer part of srd rounded 

Pipelined Floating-Point Load 

pfld.2 srd(src2), freg (Normal) 

pfld.z srd(src2)-\- + , freg (Autoincrement) 

freg <— mem.z (third previous pfld's (srd + src2)) 

(where .z is precision of third previous pfld.z) 
If autoincrement 
THEN src2 <r- srd -^ src2 
Fl 

pfle.p srd, src2, rdest Pipelined F-P Less-Than or Equal Compare 

Assembler pseudo-operation, identical to pfgt.p except that 

assembler sets R-bit of instruction. 
rdest *<— last A-stage result 
CO clear if srd ^ src2, else set 
Advance A pipeline one stage 
A pipeline first stage is undefined, but no result exception occurs 

pfmam.p srd, src2, rdest Pipelined Floating-Point Add and l\/lultiply 

rdest <— last M-stage result 

Advance A and M pipeline one stage (operands accessed before advancing pipeline) 

A pipeline first stage <— A-op1 - A-op2 

M pipeline first stage <<— M-op1 x M-op2 

pfmov.r srd, rdest Pipelined Floating-Point Reg-Reg IVIove 

Assembler pseudo-operation 

pfmov.ss srd, rdest = pfiadd.ss srd, fO, rdest 
pfmov.dd srd, rdest = pfiadd.dd srd, fO, rdest 
pfmov.sd srd, rdest = pfadd.sd srd, fO, rdest 
pfmov.ds srd, rdest = pfadd.ds srd, fO, rdest 

pfmsm.p srd, src2, rdest Pipelined Floating-Point Subtract and Multiply 

rdest <— last M-stage result 

Advance A and M pipeline one stage (operands accessed before advancing pipeline) 

A pipeline first stage <—■ A-op1 - A-op2 

M pipeline first stage <— M-op1 x M-op2 

pfmul.p srd, src2, rdest Pipelined Floating-Point Multiply 

rdest <— last M-stage result 

Advance M pipeline one stage 

M pipeline first stage ^<— srd x src2 
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pfmulS.p src1, src2, rdest Three-Stage Pipelined l\/lultiply 

rdest <— last M-stage result 
Advance 3-Stage M pipeline one stage 
M pipeline first stage <— srcl x src2 

pform src1, rdest Pipelined OR to MERGE Register 

rdest <r- last-stage l-result 

last stage l-result <— srcl OR MERGE 

MERGE <r- 

pfsm.p srcl, src2, rdest Pipelined Floating-Point Subtract and Multiply 

rdest <— last A-stage result 

Advance A and M pipeline one stage (operands accessed before advancing pipeline) 

A pipeline first stage <— A-op1 - A-op2 

M pipeline first stage ^~ M-op1 x M-op2 

pfsub.p src1, src2, rdest Pipelined Floating-Point Subtract 

rdest <r- last A-stage result 

Advance A pipeline one stage 

A pipeline first stage <^ src1 + src2 

pftruncp src1, rdest Pipelined Floating-Point to Integer Conversion 

rdest ^^— last A-stage result 
Advance A pipeline one stage 

A pipeline first stage -<— 64-bit value with low-order 32 bits 
equal to integer part of srcl 

pfzchkl srcl, src2, rdest Pipelined 32-Bit Z-Buffer Check 

Consider srcl, src2, and rdest, as arrays of two 32-bit 

fields src/(0)..src/(1), src2{!S)..src2{y\ and rdest(0)..rdest(A) 

where zero denotes the least significant field. 
PM <r- PM shifted right by 2 bits 
FOR i = to 1 
DO 

PM [1 + 6] -^-^ src2{\) < src1{\) (unsigned) 

rdesti}) <^ last-stage l-result 

last-stage l-result <— smaller of src2(\) and src1(\) 
OD 
MERGE ^ 

pfzchks srcl, src2, rdest Pipelined 16-Blt Z-Buffer Check 

Consider srcl, src2, and rdest, as arrays of four 16-bit 

fields src1(0)..src1(3), src2(0)..src2{3), and rdest(0)..rdest{3) 

where zero denotes the least significant field. 
PM <— PM shifted right by 4 bits 
FOR i = to 3 
DO 

PM [i + 4] -^r- src2(\) ^ src1(\) (unsigned) 

rdest(\) <— last-stage l-result 

last-stage l-result ««— smaller of src2(\) and src1(i) 
OD 
MERGE ^0 

pst.d freg, *const(src2) .Pixel Store 

P8t.d freg, *cor)st(src2)-v + Pixel Store Autoincrement 

Pixels enabled by PM in mem.D {src2 + * const) "<— freg 

Shift PM right by 8/plxel size (in bytes) bits 

IF autoincrement THEN src2 <— * const + src2 Fl 

8hl srdd, src2, rdest Shift Left 

rdest <— src2 shifted left by src1 bits 

shr src1, src2, rdest Shift Right 

SC (in psr) <— src1 

rdest ^r- src2 shifted right by src1 bits 
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shra src1, src2, rdest Shift Right Arithmetic 

rdest <— src2 arithmetically shifted right by src1 bits 

shrd src1, src2, rdest Shift Right Double 

rdest <— low-order 32 bits of src1:src2 shifted right by SC bits 

st.c srdni, ctrlreg Store to Control Register 

ctrlreg ^«— srdni 

st.x srdni, *const(src2) Store Integer 

mem.x (src2 ^- * const) <— srdni 

subs srd, src2, rdest Subtract Signed 

rdest <— srd - src2 
OF <— (bit 31 carry ¥= bit 30 carry) 
CC set if src2 > srd (signed) 
CC clear if src2 ^ srd (signed) 

subu srd, src2, rdest Subtract Unsigned 

rdest <— srd - src2 
OF ^ NOT (bit 31 carry) 
CC <— bit 31 carry 
(i.e. CC set if src2 <. srd (unsigned) 
CC clear if src2 > srd (unsigned) 

trap srd, src2, rdest Software Trap 

Generate trap with IT set in psr 

unlock End Interlocked Sequence 

Clear BL in dirbase. The next load or store unlocks the bus. 

xor srd, src2, rdest Logical Exclusive OR 

rdest <— srd XOR src2 

CC set if result is zero, cleared otherwise 

xorh # const, src2, rdest Logical Exclusive OR High 

rdest <— (#coA7sf shifted left 16 bit) XOR src2 
CC set if result is zero, cleared otherwise 
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Table 8.1. FADDP MERGE Update 



Pixei 

Size 

(from PS) 


Fieids Loaded From 
Resuit into i\/IERGE 


Right Shift 

Amount 

(Field Size) 


8 
16 
32 


63..56. 47..40, 31..24, 15..8 
63..58, 47..42, 31..26, 15..10 
63.56, 31. .24 


8 
6 
8 



8.2 Instruction Format and Encoding 

All instructions are one word long and begin on a 
word boundary. There are two general core-instruc- 
tion formats: REG-format and CTRL-format. Within 
the REG-format are several variations. 



8.2.1 REG-FORIViAT INSTRUCTIONS 

The src2 field selects one of the 32 integer registers 
(most Instructions) or five control registers (st.c and 
Id.c). Dest selects one of the 32 integer registers 
(most instructions) or floating-point registers (fid, 
fst, pfid, pst, ixfr). For instructions where src1 is 
optionally an immediate value, bit 26 of the opcode 
(1-bit) indicates whether src1 is an immediate. If bit 
26 is clear, an integer register is used; if bit 26 is set, 
src1 is contained in the low-order 16 bits, except for 
bte and btne instructions. For bte and btne, the 
five-bit immediate value is contained in the src1 
field. For st, bte, btne, and bla, the upper five bits of 
the offset or broffset are contained in the dest field 



instead of src1, and the lower 11 bits of offset are 
the lower 11 bits of the instruction. 

For Id and st, bits 28 and zero determine operand 
size as follows: 



Bit 28 


BitO 


Operand Size 





1 

1 




1 


1 


8-bits 

8-bits 

16-blts 

32-bits 



When src1 is an immediate and bit 28 is set, bit zero 
of the immediate value is forced to zero. 

For fid, fst, pfid, pst, and flush, bit selects autoin- 
crement addressing if set. Bits one and two select 
the operand size as follows: 



Biti 


Bit 2 


Operand Size 




1 
1 



1 

1 


64-bits 

128-bits 

32-bits 

32-bits 



When src1 is an immediate value, bits zero and one 
of the immediate value are forced to zero to main- 
tain alignment. When bit one of the immediate value 
is clear, bit two is also forced to zero. 





31 25 


General Format 

20 15 10 






OPCODE/I 


SRC2 


DEST 


SRC1 


IMMEDIATE/OFFSET, OR NULL 






16-Bit Immediate Variant (except bte and btne) 

31 25 20 15 






OPCODE 


1 


SRC2 


DEST 


IMMEDIATE 






31 25 


St, bla, bte, and btne 

20 15 10 






OPCODE/I 


SRC2 


OFFSET 
HIGH 


SRC1 
SRC1S 


OFFSET LOW 






31 25 


bte and btne with 5-Bit Immediate 

20 15 10 






OPCODE 


1 


SRC2 


OFFSET 
HIGH 


IMMEDIATE 


OFFSET LOW 
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8.2.2 REG-FORMAT OPCODES 







31 










26 


ld.x 


Load Integer 











L 





1 


st.x 


Store Integer 











L 


1 


1 


ixfr 


Integer to F-P Reg Transfer 














1 







(reserved) 











1 


1 





f ld.x, fst.x 


Load/Store F-P 








1 





LS 


1 


flush 


Flush 








1 


1 





1 


pst.d 


Pixel Store 








1 


1 


1 


1 


Id.c, st.c 


Load/Store Control Register 








1 


1 


LS 





bri 


Branch Indirect 



















trap 


Trap 
















1 




(Escape for F-P Unit) 













1 







(Escape for Core Unit) 













1 


1 


bte, btne 


Branch Equal or Not Equal 










1 


E 


1 


pfld.y 


Pipelined F-P Load 







1 








1 




(CTRL-Format Instructions) 







1 


X 


X 


X 


addu, -s, subu, -s, 


Add/Subtract 










so 


AS 


1 


shI, shr 


Logical Shift 







1 





LR 


1 


shrd 


Double Shift 







1 


1 








bla 


Branch LCC Set and Add 







1 


1 





1 


shra 


Arithmetic Shift 







1 


1 


1 


1 


and(h) 


AND 












H 


1 


andnot(h) 


ANDNOT 









1 


H 


1 


or(h) 


OR 






1 





H 


1 


xor(h) 


XOR 






1 


1 


H 


1 




(reserved) 






X 


X 


1 






L Integer Length 

—8 bits 

1 —16 or 32 bits (selected by bit 0) 
LS Load/Store 

—Load 

1 —Store 
SO Signed/Ordinal 

—Ordinal 

1 — Signed 
H High 

—and, or, andnot, xor 

1 — andh, orh, andnoth, xorh 



AS Add/Subtract 

—Add 

1 —Subtract 
LR Left/Right 

—Left Shift 

1 —Right Shift 
E Equal 

— Branch on Not Equal 

1 —Branch on Equal 
I Immediate 

—src1 is register 

1 — src1 is immediate 



8.2.3 CORE ESCAPE INSTRUCTIONS 
















31 26 


15 


10 




5 






10 11 


reserved 


SRC1 


reserved 


OPCODE 
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8.2.4 CORE ESCAPE OPCODES 







4 













(reserved) 

















lock 


Begin Interlocked Sequence 














1 


calli 


Indirect Subroutine Call 











1 







(reserved) 











1 


1 


intovr 


Trap on Integer Overflow 








1 










(reserved) 








1 





1 




(reserved) 








1 


1 





unlock 


End Interlocked Sequence 








1 


1 


1 




(reserved) 





1 


X 


X 


X 




(reserved) 


1 





X 


X 


X 




(reserved) 


1 


1 


X 


X 


X 



8.2.5 CTRL-FORMAT INSTRUCTIONS 



31 



28 



25 






1 


1 


OPC 


BROFFSET 



BROFFSET is a signed 26-bit relative branch offset. 



8.2.6 CTRL-FORMAT OPCODES 



28 



T Taken 

— be or bnc 

1 — bet or bnct 



26 



br Branch Direct 
call Call 
bc(.t) Branch on CO Set 
bnc(.t) Branch on CO Clear 





1 
1 


1 
1 

1 




1 

T 
T 



8.2.7 FLOATING-POINT INSTRUCTION ENCODING 






















31 25 20 15 




7 











10 10 


SRC2 


DEST 


SRC1 


P 


D 


s 


R 


OPCODE 



















SRC1, SRC2 —Source; one of 32 floating-point registers 
DEST — Destination register 

(instructions other than fxfr) one of 32 floating-point registers 

(fxfr) one of 32 integer registers 



Pipelining 

1 — Pipelined instruction mode 

— Scalar Instruction mode 
Dual-Instruction Mode 

1 — Dual-Instruction mode 
— Single-instruction mode 



Source Precision 

1 — Double-precision source operands 

— Single-precision source operands 
Result Precision 

1 — Double-precision result 
— Single-precision result 
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8.2.8 FLOATING-POINT OPCODES 







6 















pfam 


Add and Multiply* 














pfmam 


Multiply with Add* 













DPC 




pfsm 


Subtract and Multiply* 





n 


1 




npn 




pfmsm 


Multiply with Subtract* 


u 


1 




Ur\j 




(p)fmul 


Multiply 






















fmlow 


Multiply Low 



















1 


frcp 


Reciprocal 
















1 





frsqr 


Reciprocal Square Root 
















1 


1 


(p)fadd 


Add 







1 














(p)fsub 


Subtract 







1 











1 


(p)fix 


Fix 







1 








1 





pfgt/pfle** 


Greater Than 







1 





1 








pfeq 


Equal 







1 





1 





1 


(p)ftrunc 


Truncate 







1 


1 





1 





fxfr 


Transfer to Integer Register 






















(p)fiadd 


Long-Integer Add 










1 








1 


(p)fisub 


Long-Integer Subtract 










1 


1 





1 


(p)fzchkl 


Z-Check Long 







1 





1 


1 


1 


(p)fzchks 


Z-Check Short 







1 


1 


1 


1 


1 


(p)faddp 


Add with Pixel Merge 







1 














(p)faddz 


Add with Z Merge 







1 











1 


(p)form 


OR with MERGE Register 







1 


1 





1 






*pfam and pfsm have P-bit set; pfmam and pfmsm have P-bit clear. 
**pfgt has R bit cleared; pfle has R bit set. 
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The following table shows the opcode mnemonics that generate the various encodings of DPC and explains 
each encoding. 









Table 8.2. DPC Encoding 








DPC 


PFAM 


PFSM 


M-Unit 


M-UnIt 


A-Unit 


A-UnIt 


T 


K 


Mnemonic 


Mnemonic 


opi 


op2 


op1 


op2 


Load 


Load* 


0000 


r2p1 


r2s1 


KR 


src2 


srcl 


M result 


No 


No 


0001 


r2pt 


r2st 


KR 


src2 


T 


M result 


No 


Yes 


0010 


r2ap1 


r2as1 


KR 


src2 


srcl 


A result 


Yes 


No 


0011 


r2apt 


r2ast 


KR 


src2 


T 


A result 


Yes 


Yes 


0100 


i2p1 


i2s1 


Kl 


src2 


srcl 


M result 


No 


No 


0101 


i2pt 


i2st 


Kl 


src2 


T 


M result 


No 


Yes 


0110 


i2ap1 


i2as1 


Kl 


src2 


srcl 


A result 


Yes 


No 


0111 


i2apt 


i2ast 


Kl 


src2 


T 


A result 


Yes 


Yes 


1000 


rat1p2 


rat1s2 


KR 


A result 


srcl 


src2 


Yes 


No 


1001 


m12apm 


m12asm 


srcl 


src2 


A result 


M result 


No 


No 


1010 


ra1p2 


ra1s2 


KR 


A result 


srcl 


src2 


No 


No 


1011 


m12ttpa 


m12ttsa 


srcl 


src2 


T 


A result 


Yes 


No 


1100 


iat1p2 


iat1s2 


Kl 


A result 


srcl 


src2 


Yes 


No 


1101 


m12tpm 


m12tsm 


srcl 


src2 


T 


M result 


No 


No 


1110 


ia1p2 


ia1s2 


Kl 


A result 


srcl 


src2 


No 


No 


1111 


m12tpa 


m12tsa 


srcl 


src2 


T 


A result 


No 


No 














DPC 


PFMAM 


PFMSM 


M-Unit 


M-UnIt 


A-Unit 


A-UnIt 


T 


K 


Mnemonic 


Mnemonic 


op1 


op2 


opi 


op2 


Load 


Load* 


0000 


mr2p1 


mr2s1 


KR 


src2 


srcl 


M result 


No 


No 


0001 


mr2pt 


mr2st 


KR 


src2 


T 


M result 


No 


Yes 


0010 


mr2mp1 


mr2ms1 


KR 


src2 


srcl 


M result 


Yes 


No 


0011 


mr2mpt 


mr2mst 


KR 


src2 


T 


M result 


Yes 


Yes 


0100 


mi2p1 


mi2s1 


Kl 


src2 


srcl 


M result 


No 


No 


0101 


mi2pt 


mi2st 


Kl 


src2 


T 


M result 


No 


Yes 


0110 


mi2mp1 


mi2ms1 


Kl 


src2 


srcl 


M result 


Yes 


No 


0111 


mi2mpt 


mi2mst 


Kl 


src2 


T 


M result 


Yes 


Yes 


1000 


mrmt1p2 


mrmt1s2 


KR 


M result 


srcl 


src2 


Yes 


No 


1001 


mm12mpm 


mm12msm 


srcl 


src2 


M result 


M result 


No 


No 


1010 


mrm1p2 


mrm1s2 


KR 


M result 


srcl 


src2 


No 


No 


1011 


mm12ttpm 


mm12ttsm 


srcl 


src2 


T 


A result 


Yes 


No 


1100 


mimt1p2 


mimt1s2 


Kl 


M result 


srcl 


src2 


Yes 


No 


1101 


mm12tpm 


mm12tsm 


srcl 


src2 


T 


M result 


No 


No 


1110 


mim1p2 


mim1s2 


Kl 


M result 


srcl 


src2 


No 


No 


1111 


mm12tpm 


mm12tsm 


srcl 


src2 


T 


M result 


No 


No 



*lf K-load is set, KR is loaded when operand-1 of the multiplier is KR; Kl is loaded when operand-1 of the multiplier Is Kl. 
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8.3 Instruction Timings 

860 microprocessor instructions take one clock to 
execute unless a freeze condition is invoked. Freeze 
conditions and their associated delays are shown in 



the table below. Freezes due to multiple simulta- 
neous cache misses result in a delay that is the sum 
of the delays for processing each miss by itself. Oth- 
er multiple freeze conditions usually add only the de- 
lay of the longest individual freeze. 



Freeze Condition 


Delay 


Instruction-cache miss 


Number of clocks to read instruction (from ADS 
clock to first READY # clock) plus time to last 
READY # of block when jump or freeze occurs 
during miss processing plus two clocks if data- 
cache being accessed when instruction-cache 
miss occurs. 


Reference to destination of load instruction that 
misses 


One plus number of clocks to read data (from 
ADS# clock to first READY# clock) minus number 
of instructions executed since load (not counting 
instruction that references load destination) 


fid miss 


One plus number of clocks from ADS# to first 
READY# 


call/calli/ixf r/fxf r/id.c/st.c and data cache miss 
processing in progress 


One plus number of clocks until first READY # 
returned 


Id/st/pf Id/f Id/fst and data cache miss 
processing in progress 


One plus number of clocks until last READY # 
returned 


Reference to destoi Id, call, calli, fxfr, or Id.c in 

the next instruction 


One clock 
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Freeze Condition 


Deiay 


Reference to destoi fid/pfid/ixfr in the next two 
instructions 


Two clocks in the first instruction; one in the 
second instruction 


bc/bnc/bct/bnc.t following fadd/fsub/pfeg/ 

pfgt 


One clock 


Srcl of multiplier operation refers to result of 
previous operation 


One clock 


Floating-point operation or fst and scalar 
operation in progress other than f rep or f rsqr 


If the scalar operation is fadd, fix, f miow, f mui.ss, 
f mui.sd, ftrunc, or fsub, three minus the number 
of instructions executed after the scalar operation. 
If the scalar operation is f mui.dd, four minus the 
number of instructions executed after it. Add one if 
the precision of the result of the previous scalar 
operation is different than that of the source. Add 
one if the floating-point operation Is pipelined and 
Its destination is not fO. If the sum of the above 
terms is negative, there is no delay. 


Multiplier operation preceded by a double- 
precision multiply 


One clock 


TLB miss 


Five plus the number of clocks to finish two reads 
plus the number of clocks to set A-bits (if 
necessary) 


pfid when three pfid's are outstanding 


One plus the number of clocks to return data from 
first pfid 


pf id hits in the data cache 


Two plus the number of clocks to finish all 
outstanding accesses 


Store pipe full (two internal plus outstanding bus 
cycles) and st/fst miss, id miss, or f iush with 
modified block 


One plus the number of clocks until READY # 
active on next write data 


Address pipe full (one Internal plus outstanding 
bus cycles) and id/f id/pifd/st/fst 


Number of clocks until next address can be issued 


id/fid following st/fst hit 


One clock 


Delayed branch not taken 


One clock 


Nondelayed branch taken 


One clock 


Branch indirect br 


One clock 


st.c 


Two clocks 


Result of graphics-unit instruction (other than 
fmov) used in next instruction when the next 
instruction is an adder- or multiplier-unit instruction 


One clock 


Result of graphics-unit instruction used in next 
instruction when the next instruction is a graphics- 
unit instruction 


One clock 


flush followed by fiusii 


Two clocks 


fst followed by pipelined floating-point operation 
that oven/vrites the register being stored 


One clock 
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8.4 Instruction Characteristics 

The following table lists some of the characteristics 
of each instruction. The characteristics are: 

• What processing unit executes the instruction. 
The codes for processing units are: 

A Floating-point adder unit 

E Core execution unit 

G Graphics (vector-integer) unit 

M Floating-point multiplier unit 

• Whether the instruction is pipelined or not. A P 
indicates that the instruction is pipelined. 

• Whether the instruction is a delayed branch in- 
struction. A D marks the delayed branches. 

• Whether the instruction changes the condition 
code CC. A CC marks those instructions that 
change CC. 

• Which faults can be caused by the instruction. 
The codes used for exceptions are: 

IT Instruction Fault 

SE Floating-Point Source Exception 

RE Floating-Point Result Exception, including 

overflow, underflow, inexact result 
DAT Data Access Fault 

The instruction access fault lAT and the interrupt 
trap IN are not shown in the table because they 
can occur for any instruction. 

• Performance notes. These comments regarding 
optimum performance are recommendations 
only. If these recommendations are not followed, 
the 860 microprocessor automatically waits the 
necessary number of clocks to satisfy internal 
hardware requirements. The following notes de- 
fine the numeric codes that appear in the instruc- 
tion table: 

1 . The following instruction should not be a con- 
ditional branch (be, bnc, bet, or bnc.t). 

2. The destination should not be a source oper- 
and of the next two instructions. 



3. A load should not directly follow a store that is 
expected to hit in the data cache. 

4. When the prior instruction is scalar, srcl 
should not be the same as the rdest of the 
prior operation. 

5. The freg should not reference the destination 
of the next instruction if that instruction is a 
pipelined floating-point operation. 

6. The destination should not be a source oper- 
and of the next instruction. 

7. When the prior operation is scalar and multipli- 
er opi is srcl, src2 should not be the same as 
the rdest of the prior operation. 

8. When the prior operation is scalar, src1 and 
src2 of the current operation should not be the 
same as rdest of the prior operation. 

Programming restrictions. These indicate combi- 
nations of conditions that must be avoided by 
programmers, assemblers, and compilers. The 
following notes define the alphabetic codes that 
appear in the instruction table: 

a. The sequential instruction following a delayed 
control-transfer instruction may not be another 
control-transfer instruction (except in the case 
of external interrupts), nor a trap instruction, 
nor the target of a control-transfer instruction. 

b. When using a bri to return from a trap handler, 
programmers should take care to prevent traps 
from occurring on that or on the next sequen- 
tial instruction. IM should be zero (interrupts 
disabled) when the bri is executed. 

c. If rdest is not zero, srcl must not be the same 
as rdest. 

d. When the multiplier opi is srcl, srcl must not 
be the same as rdest. 

e. If rdest is not zero, srcl and src2 must not be 
the same as rdest. 
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Instruction 


Execution 


Pipelined? 


Sets 


Faults 


Performance 


Programming 


Unit 


Delayed? 


CC? 


Notes 


Restrictions 


adds 


E 




cc 




1 




addu 


E 




CC 




1 




and 


E 




cc 








andh 


E 




cc 








andnot 


E 




cc 








andnoth 


E 




cc 








be 


E 












bet 


E 


D 








a 


bla 


E 


D 








a 


bnc 


E 












bnct 


E 


D 








a 


br 


E 


D 








a 


bri 


E 


D 








a.b 


bte 


E 












btne 


E 












call 


E 


D 






2 


a 


call! 


E 


D 






2 


a 


fadd.p 


A 






SE.RE 






faddp 


G 








8 




faddz 


G 








8 




fiadd.z 


G 








8 




fisub.z 


G 








8 




fix.p 


A 






SE,RE 






fld.y 


E 






DAT 


2,3 




flush 


E 












fmlow.p 


M 








4 




fmul.p 


M 






SE,RE 


4 




form 


G 








8 




frep.p 


M 






SE.RE 






frsqr.p 


M 






SE.RE 






fst.y 


E 






DAT 


5 




fsub.p 


A 






SE.RE 






ftrunc.p 


A 






SE.RE 






fxfr 


G 








6.8 




fzchkl 


G 








8 




fzchks 


G 








8 




Intovr 


E 






IT 






Ixfr 


E 








2 




ld.c 


E 












ld.x 


E 






DAT 


6 




or 


E 




cc 








orh 


E 




cc 
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Instruction 


Execution 


Pipelined? 


Sets 


Faults 


Performance 


Programming 


II I9I>I Uwllvl 1 


Unit 


Delayed? 


CC? 


Notes 


Restrictions 


pfadd.p 


A 


P 




SE.RE 






pfaddp 


G 


P 






8 


e 


pfaddz 


G 


P 






8 


e 


pfam.p 


A&M 


P 




SE, RE 


7 


d 


pfeq.p 


A 


P 


cc 


SE 


1 




pfgt.p 


A 


P 


CC 


SE 


1 




pfiadd.z 


G 


P 






8 


e 


pfisub.z 


G 


P 






8 


e 


pfix.p 


A 


P 




SE.RE 






pfld.z 


E 


P 






2 




pfmul.p 


M 


P 




SE.RE 


4 


c 


pform 


G 


P 






8 


e 


pfsm.p 


A&M 


P 




SE.RE 


7 


d 


pfsub.p 


A 


P 




SE.RE 






pftrunc.p 


A 


P 




SE.RE 






pfzchkl 


G 


P 






8 




pfzchks 


G 


P 






8 




pst.d 


E 






DAT 






shI 


E 












shr 


E 












shra 


E 












shrd 


E 












st.c 


E 












st.x 


E 






DAT 






subs 


E 




cc 




1 




subu 


E 




cc 




1 




trap 


E 






IT 






xor 


E 




cc 








xorh 


E 




cc 
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Suite 220 


Tel: (714) 835-9642 
TWX: 910-595-1114 


Dayton 45414 
Tel: (513) 890-5350 
TWX: 810-450-2528 


Suite 550 




Greenbelt 20770 


fflss-; 


Tel: (301) 441-1020 






tintel Corp.* 

25700 Science Park Dr., Suite 100 


2700 San Tomas Expressway 


MASSACHUSETTS 


2nd Floor 




Beachwood 44122 


Santa Clara 95051 


tintel Corp.* 


Tel: (216) 464-2736 
TWX: 810-427-9298 


Tei: (408) 986-8086 
TWX: 910-338-0255 


Westford Corp. Center 
3 Carlisle Road 


FAX: 408-727-2620 


2nd Floor 
Westford 01886 


OKLAHOMA 


COLORADO 


Tel: (508) 692-3222 
TWX: 710-343-6333 


tintel Corp. 
6801 N. Broadway 


tintel Corp. 

4445 Northpark Drive 




Suite 115 


MICHIGAN 


Oklahoma City 731 62 


Suite 100 






Colorado Springs 80907 
Tei: (719)5*14-6622 


707? Orchard Lake Road 

Suite 100 

West Bloomfield 48322 




OREGON 


tintel Corp.* 

650 S. Cherry St., Suite 915 

Denver 80222 


tintel Corp. 

15254 N.W. Greenbrier Parkway 

Building B 

Beaverton 97006 


Tel: (313) 851-8096 


Tel: (303) 321-8086 
TWX: 910-931-2289 


MINNESOTA 




Tel: (503) 645-8051 
TWX: 910-467-8741 




tintel Corp. 

3500 W. 80th St., Suite 360 


CONNECTICUT 






Bloomington 55431 
Tel: (612) 835-6722 
TWX: 910-576-2867 


PENNSYLVANIA 


tintel Corp. 

26 Mill Plain Road 




Kfp£:&»..A..™. 


2nd Floor 




Danbury 06811 
Tei: (203) 748-3130 
TWX: 710-456-1199 


MISSOURI 


Suite 230 




Tel: (215) 641-1000 
TWX: 510-661-2077 


tintel Corp. 

4203 Earth City Expressway 




FLORIDA 


Suite 131 






Earth City 63045 
Tei: (314) 291-1990 


Intel Corp.* 


tintel Corp. 

6363 N.W. 6th Way, Suite 100 

Ft. Lauderdale 33309 


400 Penn Center Blvd., Suite 610 




Pittsburgh 15235 
Tel: (412) 823-4970 


NEW JERSEY 


Tel: (305) 771-0600 
TWX: 510-956-9407 






tintel Corp.* 


PUERTO RICO 


FAX: 305-772-8193 








tintel Microprocessor Corp. 
South Industrial Park 


tintel Corp. 

S850 J.G.Lee Blvd. 


Red Bank 07701 


Tel: (201) 747-2233 


P.O. Box 910 


Suite 340 




Las Piedras 00671 


Orlando 32822 


tintel Corp. 

280 Corporate Center 


Tei: (809) 733-8616 


Tel: (407) 240-8000 
FAX: 407-240-8097 




75 Livingston Avenue 






First FIcrar 




11300 4th'street North 


Roseland 07068 




Tel: (201) 740-01 11 
FAX: 201-740-0626 




Suite 170 




St. Petersburg 3371 6 






Tel: (813) 577-2413 
FAX: 813-578-1607 











TEXAS 

tintel Corp. 

313 E. Anderson Lane 

Suite 314 

Austin 78752 

Tel: (512) 454-3628 

tintel Corp.* 
12000 Ford Road 
Suite 400 



tintel Corp.* 
7322 S.W. Freeway 
Suite 1490 
Houston 77074 
Tei: (713) 988-8086 
TWX: 910-881-2490 

UTAH 



Suite 104 
Murray 84107 
Tei: (801) 263-8051 

VIRGINIA 

tintel Corp. 
1504 Santa Rosa Road 
Suite 108 
Richmond 23288 
Tel: (804) 282-5668 

WASHINGTON 

tintel Corp. 
155 108th Avenue N.E. 
Suite 386 
Bellevue 98004 
Tel: (206) 453-8086 
TWX: 910-443-3002 



WISCONSIN 

tintel Corp. 
330 S. Executive Dr. 
Suite 102 
Brookfield 53005 
Tel; (414) 784-8087 
FAX: (414) 796-2115 

CANADA 

BRITISH COLUMBIA 

Intel Semiconductor of Canada, Ltd. 
4585 Canada Way, Suite 202 
Burnaby V5G 4L6 
Tel: (604) 298-0387 
FAX: (604) 298-8234 

ONTARIO 

tintel Semiconductor of Canada, Ltd. 

2650 Queensview Drive 

Suite 250 

Ottawa K2B 8H6 

Tel: (613) 829-9714 

TLX: 053-4115 

tintel Semiconductor of Canada, Ltd. 

190 Attwell Drive 

Suite 500 

RexdaleM9W6H8 

Tel: (416) 675-2105 



tintel Semiconductor of Canada, Ltd. 
620 St. John Boulevard 
Pointe Claire H9R3K2 



Tei: (514) 
TWX: 514 



inter 



DOMESTIC SERVICE OFFICES 



ALABAMA 


FLORIDA 


MICHIGAN 


NORTH CAROLINA 


TEXAS (Cont'd.) 


Intel Corp. 


Intel Corp. 

6363 N.W. 6th Way 


Intel Corp. 


Intel Corp. 


Intel Corp.* 


5015 Bradford Dr., #2 


7071 Orchard Lake Road 


5700 Executive Drive 


12000 Ford Road 




Suite 100 


Suite 100 


Suite 213 


Suite 400 


Tel: (205) 830-4010 


Ft. Lauderdale 33309 


West Bloomfield 48033 


Charlotte 28212 


Dallas 75234 




Tel: (305) 771-0600 
TWX: 510-956-9407 


Tel: (313) 851-8096 


Tel: (704) 568-8966 


Tel: (214) 241-8087 
FAX: 214-484-1180 


ARIZONA 








FAX: 305-772-8193 


MINNESOTA 


Intel Corp. 




Intel Corp. 








7322 S.W.' Freeway 


11225 N. 28th Dr. 


Intel Corp. 


Intel Corp. 


Suite 206 


SuiteD-214 


5850 T.G. Lee Blvd. 


3500 W. 80th St., Suite 360 




Suite 1490 


Phoenix 85029 


Suite 340 


Bloomington 55431 
Tel: (612) 835-6722 
TWX: 910-576-2867 


Tel: (919) 294-1541 


Houston 77074 


Tel: (602) 869-4980 


Orlando 32822 




Tel: (713) 988-8086 
TWX: 910-881-2490 




Tel: (305) 240-8000 
FAX: 305-240-8097 


Intel Corp. 


Intel Corp. 




2700 Wycliff Road 
Suite 102 




500 E. Fry Blvd., Suite M-1 5 




MISSOURI 


UTAH 


Sierra Vista 85635 


Intel Corp. 

11 300 4th Street North 




Raleigh 27607 
Tel: (919) 781-8022 




Tel: (602) 459-5010 


Intel Corp. 


Intel Corp. 




Suite 170 


4203 Earth City Expressway 




428 East 6400 South 


Intel Corp. 

1161 N. El Dorado Place 


St. Petersburg 33716 
Tel: (813) 577-2413 
FAX: 813-578-1607 


Suite 131 


OHIO 


Suite 104 


Earth City 63045 




Murray 84107 
Tel: (801) 263-8051 


Suite 301 


Tel: (314) 291-1990 


Intel Corp.* 


Tucson 85715 






3401 Park Center Drive 




Tel: (602) 299-6815 


GEORGIA 


NEW JERSEY 


Suite 220 
Dayton 45414 
Tel: (513) 890-5350 
TWX: 810-450-2528 


VIRGINIA 


CALIFORNIA 


Intel Corp. 


Intel Corp. 
Raritan Plaza III 


Intel Corp. 




3280 Pointe Parkway 


1504 Santa Rosa Road 


Intel Corp. 


Suite 200 


Raritan Center 




Suite 108 


21515 Vanowen Street 


Norcross 30092 


Edison 08817 


Intel Corp.* 


Richmond 23288 


Suite 116 


Tel: (404) 449-0541 


Tel: (201) 225-3000 


25700 Science Park Dr. 


Tel: (804) 282-5668 


Canoga Park 91303 
Tel: (818) 704-8500 






Suite 100 




ILLINOIS 


Intel Corp. 


Beachwood 44122 


WASHINGTON 






385 Sylvan Avenue 


Tel: (216) 464-2736 
TWX: 810-427-9298 




Intel Corp. 


Intel Corp.* 


Englewood Cliffs 07632 
Tel: (201) 567-0821 
TWX: 710-991-8593 


Intel Corp. 

155 108th Avenue N.E. 


2250 E. Imperial Highway 




Suite 218 


Suite 400 


OKLAHOMA 


Suite 386 


El Segundo 90245 
Tel: (213) 640-6040 


Schaumburg 60173 
Tel:(312)3f0-8031 






Bellevue 98004 


Intel Corp.* 


Intel Corp. 

6801 N. Broadway 


Tel: (206) 453-8086 
TWX: 910-443-3002 






Parkway 109 Office Center 


Intel Corp. 


INDIANA 


328 Newman Springs Road 


Suite 115 




1900 Prairie City Rd. 
Folsom 95630-9597 




Red Bank 07701 


Oklahoma City 731 62 


Intel Corp. 

408 N. Mullan Road 


Intel Corp. 


Tel: (201) 747-2233 


Tel: (405) 848-8086 


Tel: (916) 351-6143 


8777 Purdue Road 






Suite 102 




Suite 125 


tlntel Corp. 

280 Corporate Center 


OREGON 


Spokane 99206 


Intel Corp. 

1510 Arden Way, Suite 101 

Sacramento 9581 5 


Indianapolis 46268 
Tel: (317) 875-0623 




Tel: (509) 928-8086 


75 Livingston Avenue 


Intel Corp. 






First Floor 


WISCONSIN 


Tel: (916) 920-8096 


IOWA 


Roseland 07068 
Tel: (201) 740-0111 
FAX: 201-740-0626 


Building B 
Beaverfon 97006 


Intel Corp. 


Intel Corp. 


Intel Corp. 


Tel: (503) 645-8051 
TWX: 910-467-8741 


330 S. Executive Dr. 


4350 Executive Drive 


1930 St. Andrews Drive N.E. 




Suite 102 


Suite 105 


2nd Floor 


NEW MEXICO 




Brookfield 53005 


San Diego 92121 
Tel: (619) 452-5880 


Cedar Rapids 52402 




Intel Corp. 

5200 N.e.Elam Young Parkway 


Tel: (414) 784-8087 


Tel: (319) 393-5510 


Intel Corp. 


FAX: (414)796-2115 






8500 Menaul Boulevard N.E. 


Hillsboro 97123 




Intel Corp.* 

400 N. Tustin Avenue 




Suite B 295 
Albuquerque 87112 
Tel: (S05) 292-8086 


Tel: (503) 681-8080 


CANADA 


Suite 450 


Intel Corp. 


PENNSYLVANIA 




Santa Ana 92705 


8400 W.I 10th Street 






BRITISH COLUMBIA 


Tel: (714) 835-9642 
TWX: 910-595-1114 


Suite 170 


NEW YORK 


Intel Corp.* 




Overland Park 66210 






Intel Semiconductor of Canada, Ltd. 




Tel: (913) 345-2727 


Intel Corp. 


Suite 230 


4585 Canada Way, Suite 202 
BurnabyV5G4L6 


Intel Corp.* 




127 Main Street 


Fort Washington 19034 
Tel: (215) 641-1000 
TWX: 510-661-2077 


San Tomas 4 


MARYLAND 




Tel: (604) 298-0387 


2700 San Tomas Expressway 




Tel: (607) 773-0337 
FAX: 607-723-2677 


FAX: (604) 298-8234 


2nd Floor 


Intel Corp.* 






Santa Clara 95051 


7321 Parkway Drive South 




Intel Corp.* 


ONTARIO 


Tel: (408) 986-8086 
TWX: 910-338-0255 


Suite C 


Intel Corp.* 


400 Penn Center Blvd. 




Hanover 21076 


860 Cross Keys Office Park 
Fairport 14450 


Suite 610 


Intel Semiconductor of Canada, Ltd. 




Tel: (301) 796-7500 
TWX: 710-862-1944 


Pittsburgh 15235 
Tel: (412) 823-4970 




COLORADO 


Tel: (716) 425-2750 
TWX: 510-253-7391 


Suite 250 








Ottawa K2B 8H6 


Intel Corp. 


Intel Corp. 

7833 Walker Drive 




PUERTO RICO 


Tel: (613) 829-9714 
TLX: 053-4115 


4445 Northpark Drive 


Intel Corp.* 




Suite 100 


Suite 550 


300 Motor Parkway 


Intel Microprocessor Corp. 




Colorado Springs 80907 
Tel: (303) 594-6622 




TeMlf6)%1-3300 
TWX: 510-227-6236 


South Industrial Park 


Intel Semiconductor of Canada. Ltd. 


Tel: (301) 441-1020 


P.O. Box 910 


190 Attwell Drive 






Las Piedras 00671 


Suite 500 


Intel Corp.* 

650 S.Cherry St., Suite 915 

Denver 8022^ 


MASSACHUSETTS 




Tel: (809) 733-8616 


RexdaleM9W6H8 




Intel Corp. 




Tel: (416) 675-2105 
TLX: 06983574 


Intel Corp.* 
Westford Corp. Center 


Westage Business Center 
BIdg. 300, Route 9 


TEXAS 


Tel: (303) 321 -8086 




FAX: (416) 675-2438 


TWX: 910-931-2289 


3 Carlisle Road 


Fishkill 12524 


Intel Corp. 






2nd Floor 


Tel: (914) 897-3860 
FAX: 914-897-3125 




QUEBEC 


CONNECTICUT 


Westford 01886 


Suite 314 






Tel: (508) 692-3222 
TWX: 710-343-6333 




Austin 78752 


Intel Semiconductor of Canada, Ltd. 


Intel Corp. 

26 Mill Plain Road 




Tel: (512) 454-3628 


620 St. John Boulevard 








Pointe Claire H9R 3K2 


2nd Floor 
Danbury 06811 
Tel: (203) 748-3130 
TWX: 710-456-1199 








Tel: (514) 694-9130 
TWX: 514-694-9134 



















CUSTOMER TRAINING CENTERS 



CALIFORNIA 

2700 San Tomas Expressway 
Santa Clara 95051 
Tel: (408) 970-1700 



300 N. Martingale, #300 
Schaumburg 60173 
Tel:(312)3f0-5700 



MASSACHUSETTS 

3 Carlisle Road 
Westford 01886 
Tel: (508) 692-1000 



7833 Walker Dr., 4th Floor 
Greenbelt 20770 
Tel: (301) 220-3380 



SYSTEMS ENGINEERING OFFICES 



300 N. Martingale, #300 
Schaumburg 60173 
Tel:(312)3f0-8031 



300 Motor Parkway 
Hauppauge 11788 
Tel: (516) 231-3300 



UNITED STATES 

Intel Corporation 

3065 Bowers Avenue 

Santa Clara, CA 95051 

JAPAN 

Intel Japan K.K. 

5-6 Tokodai, Tsukuba-shi 

Ibaraki, 300-26 

FRANCE 

Intel Corporation S.A.R.L. 

1, Rue Edison, BP 303 

78054 Saint-Quentin-en-Yvelines Cedex 

UNITED KINGDOM 

Intel Corporation (U.K.) Ltd. 

Pipers Way 

Swindon 

Wiltshire, England SN3 IRJ 

WEST GERMANY 

Intel Semiconductor GmbH 

Dornacher Strasse 1 

8016 Feldkirchen bei Muenchen 

HONG KONG 

Intel Semiconductor Ltd. 

10/F East Tower 

Bond Center 

Queensway, Central 

CANADA 

Intel Semiconductor of Canada, Ltd. 

190 Attwell Drive, Suite 500 

Rexdale, Ontario M9W 6H8 
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